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Fig. 3d-e 




0 10 20 30 40 50 60 

fraction (no.) 



A 280 
0,3 



0,2] 
0,1 
0 



chymotrypsin 






0 

















M NaCI 
1 

0,8 



0,4 
0,2 



0 10 20 30 40 50 60 

fraction (no.) 



o 



70 



REPLACEMENT SHEET NO. 7 J 

7/29 
Figure 4 

HtHl cDNA sequence and intron structure 
Domain a 

GGCTTGTTCAGTTTCTACTCGTCGCCCTTGTGGCGGGGGCTGGAGCAGACAACGTCGTCAG 
AA2VGGACGTGAGTCACCTCACGGA.TGACGAGGTGCAAGCTCTCCACGGCGCCCTCCATGAC 
G T C AC T G CAT C T AC AG G G C C T C T GAG T T T C G AAG AC AT AAC AT C T T AC CAT G C C G C AC C AG 
CGTCGTGTGACTACAAGGGACGGAAGATCGCCTGCTGTGTCCACGGTA.TGCCCAGTTTCCC 
CTTCTGGCACAGGGCATA.TGTCGTCCAAGCCGAGCGGGCACTGTTGTCCAAACGGAAGACT 
GTCGGAATGCCTTACTGGGACTG.GACGCAAACGCTGACTCACTTAXCA.TCTCTTGTGACTG 
AAC C CAT C T AC AT T G AC AG T AAAG G T G G AAAG G C T C AAAC C AAC TAX T G G T AC C G C G G C G A 
GAT AG C G T T C A.T C AAT AAG AAG AC T G C G C GAG C T G TAG AT GAT C G C C TAT T C GAG AAG G T G 
GAG C C T G G T CAC T AC AC AC AT C T T A.T G GAG AC T G T C C T C G AC G C T C T C GAAC AG G AC GAAT 
TCTGTAAATTTGAAATCCAGTTCGAGTTGGCTCATAATGCTATCCATTACTTGGTTGGCGG 
TAAATTTGAATATTCAATGTCAAACTTGGAATACACCTCCTACGACCCCA.TCTTCTTCCTC 
CACCACTCCAACGTTGACCGCCTCTTCGCCATCTGGCAGCGTCTTCAGGAACTGCGAGGAA 
AGAATCCCAATGCAATGGACTGTGCACATGAACTCGCTCACCAGCA.ACTCCAACCCTTCAA 
CAGGGACAGCAATCCAGTCCAGCTCACAJ^AGGACCA.CTCGACACCTGCTGACCTCTTTGAT 
TAC AAAC AAC T T G GA.T AC AG C T AC G AC AG C T T AAAC C T GAAT G GAAT G AC G C C AG AAC AGC 
TGAAAACAGAACTAGACGAACGCCACTCCAAAGAAGGTGCGTTTGCAAGCTTCCGACTCAG 
TGGCTTTGGGGGTTCTGCCAACGTTGTTGTCTATGCATGTGTCCCTGA.TGATGATCCACGC 
AGTGATGACTACTGCGAGAAAGCAGGCGACTTCTTCATTCTTGGGGGTCAAAGCGAAATGC 
CGTGGAGATTCTACAGACCCTTCTTCTATGATGTAACTGAAGCGGTACATCACCTTGGAGT 
CCCGCTAAGTGGCCACTACTA.TGTGAAAACAGAACTCTTCAGCGTGAATGGCACAGCA.CTT 
TCACCTGATCTTCTTCCTCAACCAACTGTTGCCTACCGACCTGGGAAAG 



Domain b 

GTCACCTT G AC C CAC C T G T G CAT CAT C G C CAC GAT G AC GAT C T TAT T G T T C G AAJ^AAAT AT 
AGAT CAT T T G AC T C G T G AAG AG GAAT AC GAG C T AAG GAT G G C T C T G GAG AG AT T C C AG G C C 
GACACATCCGTTGATGGGTACCAGGCTACAGTAGAG.TACCATGGCCTTCCTGCTCGTTGTC 
CACGACCAGATGCAAAAGTCAGGTTCGCCTGTTGTATGCATGGCATGGCATCCTTCCCTCA 
CTGGCACCGGCTGTTCGTTACCCAGGTGGAAGA.TGCTCTTGTACGGCGTGGATCGCCTATC 
GGTGTTCCTTATTGGGACTGGACAAAACCTATGACTCACCTTCCAGACTTGGCATCAAATG 
AGAC G TAC G TAG AC C C G TAT G G AC ATAC AC AT CAT AAT C CAT T C T T C AAT GC AAAT AT AT C 
T T T T GAG GAG G G AC AC CAT CAC AC GAG C AG GAT G AT AGA.T T C G AAAC TGTTTGCCC CAG T C 
GCTTTTGGGGAGCATTCCCATCTGTTTGATGGAATCCTGTACGCATTTGAGCAGGAAGATT 
TCTGCGACTTTGAGA.TTCAGTTTGAGTTAGTCCA.TAATTCTA.TTCATGCGTGGA-TAGGCGG 
TTCCGAAGATTACTCCATGGCCACCCTGCATTACACAGCCTTTGACCCCATTTTCTACCTT 
CAT CAT T C C AAT G T C G A.T C G T C TAT G G G C AAT C T G G C AAG C T C T T C AAAT CAG GAG AC AC A 
AGCCATATCAAGCCCACTGTGCA.CAGTCTGTGGAACAGTTGCCAATGAAGCCATTTGCTTT 
C C CAT C AX C T C T T AAC AAC A-AC GAG AAG AC AC AT AG T C A_T T CAG T C C C G AC T G AC AT T TAT 
G AC TAC GAG G AAG T G C T G CAC TAC AG C TAG GAT G AT C T AAC GTTTGGTGG GAT GAAC C T T G 
AAG AAAT AG AAG AAG C TAT AC AT C T CAG AC AAC AG CAT GAAC GAG TCTTCGCGG GAT T T C T 
CCTTGCTGGAATAGGAACATCTGCACTTGTTGACATTTTCATAAATAAACCGGGGAACCAA 
CCACTCAAAGCTGGAGATATTGCCATTCTTGGTGGTGCCAAGGAAATGCCTTGGGCGTTTG 
ACCGCTTGTATAAGGTCGAAATAACTGACTCATTGAAGACACTTTCTCTCGA.TGTCGATGG 
AGAT TAT G AAG T CAC T T T T AAAAT T CAT GAT AT G C AX G G AAAC G C T C T T GA.T A C G G AC C T G 
ATTCCACACGCAGCAGTTGTTTCTGAGCCAGCTCACC 
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Intron b/c 

GTAAGTAAATTTACAAAATTTGGTGTTCTCTAACTATCCTAAGTATTCAATCGTTAGCGTG 
T AC C TAT C T G C AT AAT G CAATAC C C T G AC T C C AT AT AAG TAT AG TAT AT T T AC TCTGGTCG 
AAAACAAACAAATTGAAAACAAGAGTGGACGTGCTGTTATGATTTCTTTTTCATTCTTGGT 
TCGTTGTGTAATGCCACAGCCAGCAATTCCAGATA.TATAG"CGACGGTCTATGAATACTCCA 
GTCTGGACCAGACAATCGTGTGGAATGGTTTAGGCACATTATATCAAATTCATTGTTGAAG 
ATATGAGTTATGAGGTCACAATGTTGTCTTGTTACCCCGTGTCAGTAGTGACGTCATTTCA 
T G AC T GAAAT C T C T T C AAC G C C G T T TAG C AATAAT AG G C T C AG TAG TAT T C AAC C AAT T AC 
AAT C AG T AG AAAAT T C T C TAT AC TAT T C T TAT G T T G CAT C C T GAT AT C C C TAT G C AAAAAT 
TAG T CAT C T AAT AT AAT CAT T T T C G AT AAAT AC T T T G G G CAAACAAAT C AAT G T AAC AT C T 
ATTTTCTTTCAG 



Domain c 

C T AC C T T T GAG GAT G AAAAG C AC AG C T T AC GAAT C AGAAAAAAT G T C G AC AG C T T GAC T C C 
TGAAGAAACAAATGAACTGCGTAAAGCCCTGGAGCTTCTTGAAAATGATCATACTGCAGGT 
GGATTCAATCAGCTTGGCGCCTTCCATGGAGAGC.CTAAATGGTGCCCTAATCCTGAAGCGG 
AGCACAAGGTTGCATGCTGTGTTCATGGCATGGCTGTTTTCCCTCATTGGCACAGGCTTCT 
TGCTCTCCAGGCGGAGAATGCTCTTAGAAAGCATGGGTACAGTGGTGCTCTACCATACTGG 
GATTGGACTCGCCeCCTTTCCCAACTTCCTGATCTGGTTAGTCATGAGCAGTA.TACAGATC 
C T T C C GAC CAT C AC G T G AAG CAT AAC CCGTGGTT CAAT G G C C AC AT C G AT ACAG T AAAT C A 
G GAT AC C AC C AG AAG C G T AC G G GAG GAT C T T TAT C AAC AAC C T GAAT T T G GAC AT T T C AC G 
GATATTGCTCAACAAGTCCTCTTAGCATTAGAACAAGATGACTTCTGTTCGTTTGAAGTGC 
AG TAT GAG AT T T C C CAT AAT T T TAT C CAT G C AC T T G TAG GAG G AAC C GAC G C T TAT G G CAT 
G G C AT C G C T GAG AT AT AC AG CAT AC GAT C CAAT CTTTTTCTTG CAT CAT T C AAAC AC C GAC 
AGGATCTGGGCTA-TTTGGCAATCCCTGCAAAAATACAGAGGCAAACCGTACAACACTGCCA 
AC T G C G C C AT AGAAT C TAT GAG AAG G C C C C T G C AAC CAT T T G GAC TAAGC AG T G C CAT T AA 
C C CT GAC AG AAT C AC C AG AG AG CAT G C TAT C C C G T T T G A_T G T C T T C AAC T AT AG AGATAAC 
C T T CAT T AC G TAT AT GAT AC C C T G GAAT T T AAT GGTTTGTC GAT T T C A_C AAC T T GAT AG AG 
AGCTGGAAAAAATCAAGAGTCACGAAAGAGTATTTGCTGGATTCTTGCTGTCGGGGATTAA 
AAAATCTGCTCTTGTGAAATTCGAAGTTTGTACTCCACCTGATAATTGTCATAAAGCAGGG 
GAGTTTTATCTACTCGGGGACGAAAACGA.GATGGCTTGGGCCTA.TGACCGACTTTTCAAGT 
AT GAT AT T AC T C AG G T T C T G G AAG C AAAC CAT C T AC AC T T C T A.T GAT CAT C T C T T CAT T C G 
CTACGAAGTCTTTGATCTTAAAGGAGTGAGTTTGGGAACTGACCTGTTCCACACTGCAAAT 
GTGGTACATGATTCCGGCACAG 

Intron c/d 

G T AC G T G GAT T T GAT T AC AT AG CAAT G C TAT AT GAT T T C AG T AAT T AC AAC C T C AAG T CAT 
G TAG C C G T T T TAG AT T G CAT T AC AT C AAAC AG CAT T G GAT T AAAT T G G G G GAT T G T C C AG G 
CCGCATTATGTTGCATTCCGAAAATAGTTTGTGTCCAGTGTCCACGTTTAAAATTAAACCA 
T T T T AAT CAT AT TAG G GAT AAT T T T AAT AG AT G T TAT AG T G C T T TAT T T C A.T AT T G T T AC A 
G T G GAC AG T C AC C AAG GAC AT AT T T T AC T C TAT AG AT AC AC AAAC AC CAAT TAAAAC C C T G 
CTTTGGAAAGTCTAJVCTTTTTCCCCACAG 



Domain d 

G C AC C C G T GAT C G T GAT AAC T AC G T T G AAG AAG T T AC T G G G G C C AG T CATAT C AG G AAG AA 
TTTGAACGACCTCAATACCGGAGAAATGGAAAGCCTTAGAGCTGCTTTCCTGCATATTCAG 
GAC GAC G G AAC AT AT GAAT C TAT T G C C C AG T AC C AT G G C AAAC C AG G C AAAT G T CAAT T G A 
ATGATCATAATATTGCGTGTTGTGTCCATGGTATGCCTACCTTCCCCCAGTGGCACAGACT 
GTATGTGGTTCAGGTGGAGAATGCTCTCCTAAACAGGGGATCTGGTGTGGCTGTTCCTTAC 
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T G G GAG T G G AC T GC T C C CAT AG AC CAT C T AC C T CAT TT CAT T GAT GAT G C AAC AT AC T T C A 
ATTCCCGACAACAGCGGTACGACCCTAACCCTTTCTTCAGGGGAAAGGTTACTTTTGAAAA 
C G C AG T C AC AAC AAG G G AC C C AC AAG CCGGGCTCTT C AAC T C AG AT TAT AT G TAT G AG AAT 
GTTTTACTTGCACTGGAGCAGGAAAATTATTGTGACTTTGAAATTCAGTTTGAGCTTGTTC 
ATAACGCACTTCA.TTCCATGCTGGGAGGTAAAGGGCAGTACTCCATGTCCTCCCTGGACTA 
TTCTGCGTTTGATCCCGTCTTCTTCCTACATCATGCCAACACGGACAGACTGTGGGCAATC 
T G G C AGG AAC T AC AAAG AT T C C G AGAAC T G C C T TAT G AAG AAG C G AAC T G T G C AAT C AAC C 
T CA.T G CAT C AAC C AC T G AAG C C G T T C AG T GAT C C AC AT GAG AAT C AC G AC AAT G T C AC T T T 
G AAAT AC T C AAAAC C AC AG G AC G GAT T C G AC T AC C AG AAC C AC T T C G G A.T AC AAG T A.T G AC 
AACCTTGAGTTCCATCACTTA.TCTATCCCAAGTCTTGATGCTACCCTGAAGCAAAGGAGAA 
ATCACGACAGAGTGTTTGCGGGCTTCCTTCTTCATAACA.TAGGAACTTCTGCTGACA.TAAC 
TA.TCTACATATGTCTGCCTGACGGACGGCGTGGCAATGACTGCAGTCA.TGAGGCGGGAACA 
TTCTATATCCTCGGAGGCGAAA.CAGAGATGCCTTTTATCTTTGACCGTTTGTATAAATTTG 
AAAT C AC C AAAC C AC T G C AAC AG T T AG GAG T C AAG C T G CAT G G T G GAG T T T T C GAAC T G G A 
GCTTGAGA.TCAAGGCATACAACGGTTCCTATCTGGATCCCCATACCTTTGATCCAACTATC 
ATCTTTGAACCTGGAACAG 

Intron d/e 

G T AAT G C CA.T C T T AAT AC AG TTCGTTCGT T AAAT T A.T AT AT G T T C G T T T AC AAC AC CAT AC 
CTTGAATTGAGGTAATACATCACTTGATA*TTGATAATGTAATGGTAATTGTTCTTGTTTGT 
AAAACCGTTTCTGGGGTGTTTATTCACT.ATCCACCTGGTGGATAGTGAGTAAACACATTCG 
GTTTAATATGGGTATCTAATGGACAGTGAAGTGTGCTGGCTAGGCAGATA.CCTTGGTTTCT 
GTGAATGGAGGTAGTAGAAAGGGGTTTTGATGATTGCAG 



Domain e 

AT AC C CAT AT C T T G G AC C AC G AC CAT GAG G AAG AG A.T AC T T G T C AG G AAG AAT A.T AAT T G A 
TTTGAGCCCAAGGGAGAGGGTTTCTCTAGTCAAAGCTTTGCAAAGAATGAAGAATGATCGC 
TCCGCTGATGGGTACCAAGCCATTGCCTCTTTCCA.TGCCCTGCCACCACTCTGTCCCAATC 
CA.TCTGCA-GCTCACCGTTATGCTTGCTGTGTCCA.TGGCATGGCTACATTTCCCCAGTGGCA 
CAGACTGTACACTGTTCAGGTTCAGGATGCCCTGAGGAGACATGGTTCACTTGTTGGTA.TT 
CCTTACTGGGACTGGACAAAACCAGTCAACGAGTTACCCGAGCTTCTTTCTTCAGCAACA.T 
T TTA.T CAT C C AAT C C G GAATAT T AAT AT T T C AAAT C CAT TCCTCGGGGCT G AC A.T AG AAT T 
T G AAG G AC CGGGCGTT CAT AC AG AG AG G C AC AT AAAT A.C T GAG CGCCTGTTT C AC AG T G G G 
GATCATGACGGATACCACAACTGGTTCTTCGAAACTGTTCTCTTTGCTTTGGAACAGGAAG 
AT T AC T G C G A.T T T T G AAA.T AC AAT T T GAG A.T AG C C CAT AAT G G CAT C C AC AC AT G GAT T G G 
T G G AAG C G C AG T A.T AT G G CAT G G G AC AC C T T C AC TAT G CAT CA.T A.T G A.T C C AAT T T T C T AC 
A.T C C AC CAT T C AC AG AC G G AC AG AAT A.T G G G C TAT T T G G C AAG AG C T G C AG AAG T A.C AG G G 
GTCTATCTGGTTCGGAAGCAAACTGTGCCATTGAACATATGAGAACACCCTTGAAGCCTTT 
C AG CTTTGGGC C AC C C T AC AAT T T G AAT AG T CAT AC G C AAG AAT AT T C AAAG C C T GAG G A.C 
A.C G T T T G AC TAT AAG AAG T T T G GAT AC AG AT A.T GAT AG T C T G G AA.T T G GAG G G G C GAT C AA 
T T T C T C G CA.T T G.AT G A-AC T T A.T C C AGC AG AG AC AG GAG AAAG AC AG AAC T T T T G C AG G G T T 
CCTCCTTAAAGGTTTTGGTACATCCGCA.TCTGTGTCATTGCAAGTTTGCAGAGTTGATCAC 
ACCTGTAAAGATGCGGGCTATTTCACTATTCTGGGAGGA.TCAGCCGAAATGCCA.TGGGCA:T 
T C G AC AG G C T T TAT AAG T A.T G AC AT T AC T AAAAC T C T T C AC G AC AT GAAC C T GAG G C AC G A 
G G AC AC T T T C T C TAT AG AC G T AAC TAT C AC GTCT T AC AAT G GAAC AG T AC T C T C G G G AG A C 
CTCATTCAGACGCCCTCCATTATATTTGTACCTGGACGCC 

Intron e/f 

GTGAGTACCTGTTTGCACTAAGACTTCTGTAGGCTAAAAGTGTAAGAAATATCAATTAATT 
TCAATTCACCCAAACTTGAAAACGGTACCTATATAGGTTAACTTTTTGTCTACAGTAAACT 
GAAC AT AC C T AC ACA.T T T C AT G AAAT G A.T C T C T C AAT AT T T T C C AC C AAC AG 
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Domain f 

ATAAAC T C AAC T C AC G GAAAC AT AC AC C T AAC AG AG T C C G C CAT GAG C TAAG TAG C C T TAG 
T T C C C G T G AC AT AG CAAG C T T GAAG G C AG C T T T G AC AAG C C T T C AAC AT G AT AAT G G G AC T 
GATGGTTATCAAGCTATTGCTGCCTTCCATGGCGTTCCTGCGCAGTGCCACGAGCCATCTG 
GACGTGAG 

Intron f (1) 

G T AAAT T T AC AG AG C T T TAT GAAG T G T G T T C AG AG T GAAG AG AC CAAG AT AT AC T TAT AC C 
C AAAAC TAG C TAG C AAC AG AC GAT T T C A C TTGTTTCG G A C A C T T T G TAT TAT AC G T T G GAT 
CCCAAGGTAAACGGAAACGTAACCGAGAATCAGTCCGTAAAGTGAGTGAGTGAGTTTGGGG 
CTTAACGTCGCACTCAGCAATACCCCAGCTATGTGGCGACTCTCAGATTTACTGCTGGAGG 
AGAACCTACATAGCCCGGTTTAACCCGTGTGGTATGTAGTAAGACCAGCGCGGCATGGCTG 
GTATCTGACGGACGAAGGGTGGCGCTGCAXGTATTCCAGTGGTACAACACTGCACCCCAAT 
TTCACCGACCGGAGAACTGATCTCCCCTTCGGAGATATCGCCTGCCTTCCACGGGATTCGA 
ACT-CGGTGACCTTCAAGCCAGCGCGCTTCTAGCGGGGGCGATTAGAGGTTNAAGGCCGACG 
GCTCTACCACCTTAACTATCCCCCGGCCCCACTCCTGACGGAAATGTTTATAATTCAGCCT 
TTGTTTTCT TAT T AAAC AC T C T T G G C AG AT T T T C TAT AG AT AAT G GAT T C AC AT G TAG AC A 
GTCTCCCATTGTTGTAACTGGTAGTCAAGAGTTAGAATCTGAATACATTCTCCAAGATGGA 
T CAAG G AAAAC AAT AAT TAG T T GAT G T T G C AG 



Domain f (2) . 

ATCGCCTGTTGCATCCACGGCATGGCGACGTTTCCTCACTGGCACCGGTTGTACACTCTGC 
AGTTGGAGCAAGCGCTGCGCAGACACGGGTCCAGTGTTGCTGTTCCATACTGGGACTGGAC 
CAAG C C AAT C AC C G AAC T G C C AC AC AT T C T G AC AG AC G GAG AAT AT TAT G AC G T T T G G C AA 
AATGCCGTCTTGGCCAATCCGTTTGCAAGAGGTTATGTGAAAATTAAAGATGCATTTACGG 
TGAGAAATGTCCAGGAAAGTCTGTTCAAAATGTCAAGTTTTGGAAAGCACTCGCTTCTGTT 
TGACCAGGCTTTGTTGGCTCTTGAACAAACTGACTACTGTGACTTCGAAGTTCAGTTTGAA 
GTGATGCATAACACGATCCATTATCTCGTAGGAGGGCGTCAAACGTACGCCTTCTCCTCTC 
T C GAG TAT T C C T CAT AC GAT C C AAT C T T C T T TAT T C AC C AC TCGTTTGTT G AC AAAAT AT G 
G G C T G TAT G G CAAG AAC T G C AAAG C AG GAG AC AT C T A.C AG T T TAG AAC AG C T GAT T G T G C T 
GTGGGCCTCATGGGTCAGGCAATGAGGCCTTTCAACAAGGATTTCAACCACAACTCGTTCA 
C CAAG AAG C AC G C AG T C C C T AAT AC AG TAT T T GAT T A.T G AAGAT C T T G G C TAT AAC TAT G A 
C AAC C T T G AAAT C AG T G G T T T AAA C T T AAAT GAG AT C GAG G C G T T AAT AG C AAAAC G C AAG 
TCACATGCTAGAGTCTTTGCTGGGTTCCTGTTGTTTGGATTAGGAACTTCGGCTGATATAC 
ATCTGGAAATTTGCAAGACATCGGAAAACTGCCATGATGCTGGTGTGATTTTCATCCTTGG 
AG G T T C T G C AG AG AT G CAT T G G G CAT AC AAC C G C C T C T AC AAG TAT G A C A.T T AC AG AAG C A 
TTGCAGGAATTTGACATCAACCCTGAAGATGTTTTCCATGCTGATGAACCATTTTTCCTGA 
GGCTGTCGGTTGTTGCTGTGAATGGAACTGTCATTCCATCGTCTCATCTTCACCAGCCAAC 
GAT AAT C TAT GAAC C AG G C GAAG 

Intron f/g 

G T GAG AT AT AT G C AAAT T G AA T G T T G T C C AG AT GCGTTGTT T AC AT T TAT AT G C T T G G AAT 
T G T C C T GAAC G AAT AC AG T G G AAT AAC CAAAAG C T G AAAAAT AAAAAG AT AT AT AC T T CAT 
T C T GAAT T T G T C AG TAT T G C T G AC C CAAAAACAC G T TAT C CAT G T C G AC AC TAT AT T T G C C 
T T T C T GAAT C T GAG AC T G C G T TAT G T T T C T AAT AAT C AC G AAAT AT G G TAT AC AG G T T G T G 
TAT C T G TAG AAT AC C CAAG G C AG AAT T T AAAG G G T C AC AC C C T G T T T AAT AC AG 
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Domain g(l) 

AT C AC CAT'GAC G AC CA.T C AG T C G G GAAG CAT AG C AG GAT CCGGGGTCCG CAAG G AC G T GAA 
CA.CCTTGACTAAGGCTGAGACCGACAACCTGAGGGAGGCGCTGTGGGGTGTCATGGCAGAC 
CACGGTCCCAATGGCTTTCAAGCTATTGCTGCTTTCCATGGAAAACCAGCTTTGTGTCCCA 
TGCCTGATGGCCACAACTACTCATGTTGTACTCACGGCATGGCTACCTTCCCACACTGGCA 
TCGCCTCTACAXCAAGCAGATGGAGGATGCAATGAGGGCGCATGGGTCTCATGTCGGCCTG 
CCCTACTGGGACTGGACTGCTGCCTTCACCCACCTGCCAACACTGGTCACCGACACGGA.CA 
AC AAC C C C T T C C AAC AT 

Intron g (2 ) 

GTAAGAGCGGGGTAGGGATGGGGTGGTAGGGGGTGGGTTGTTCTA.TTACTTCCCGCTTCAC 
TTGTATGAAATGGATAACCTTGGCTGCATCCCAATTGCGTGATCGATTCTCTTTCGA.TTCA. 
CTCGTGCGATTAGACTGCCTTA.TTTACTATAGTAGTTAGAATGTTGCTCAGTGCGCCGTTA 
AAC AAC T AAT AC AC AAAAC C G CAT T T G T T T TAT AT G G T C AC T C T AC T G T T TAT C AC G TATA 
TGTATGTTCCGACTCACTGGTTGGTGCGTACCATTCTACTGTCACACTGAGAGCCAATGTT 
C T C AG AT G T G T G AAAT G T T T GAAAG C C G T T T C TACATAATAT T G C AG GA^ATAC CAT T G TAG 
AATGTAGTCAAACAGGTAACAATCTGTTAGTGAGCCCAGTTCGAGGTTGCGTTGTAGGGTG 
TAGTCCAACAGGTA.GGCAGTCCATAAGCATAGTTTTTAAGCATTTTAGATCATCTATAATT 
AAC C AC AT G G T TAG C C G C TAT G T T TAG T T T AAT C C AG T AT AAG T TAG AAC T G T T ATA.T T T C 
GAAG G GAAG T GAG TAAAT C C T T A.T T C C T T G AC T AC CAT T T AAT AG AT T T C C C AAT G AC T C C 
ATTCAACTCCTAACTTTCACATCACTGCTCTCTTCAACAG- 



Domain g(2) 

G G AC AC AT T GAT TAT C T C AAT G T C AG C AC AAC T C GAT C T C C C C GAG AC AT G C T G T T C AAC G 
ACCCCGAGCATGGATCAGAGTCGTTCTTCTACAGACAAGTCCTCTTAGCTCTGGAACAAAC 
TGATTTCTGCAAATTCGAAGTTCAGTTTGAGATAACCCACAATGCCATCCATTCCTGGACA 
GGTGGCCACAGCCCCTACGGAATGTCCA.CTCTCGACTTCACTGCCTACGATCCTCTCTTCT 
GGCTTCACCACTCCAACACCGACAGAAT.CTGGGCTGTCTGGCAAGCTTTGCAAGAATACAG 
AG G AC T T C CAT AC AAC CAT G C C AAT T G T GAG A.T C C AGG C AAT G AAAAC GCCCCT GAG GCCT 
T T C AG T G AC GAT AT C AAC C AC AAC C C AG T C AC AAAG G C T AAC G C GAAG C CAT T AGA.T G T G T 
T C GAG TAT AAT C G G T T GA.G C T T C C AG T AC G AC AAC C T CAT C T T C CA.T G GAT AC AG TAT T C C 
GGAACTTGATCGCGTGCTTGAAGAAAGAAAGGAGGAGGACAGAATATTTGCTGCCTTCCTT 
CTCAGTG G AAT CAAG C G TAG T G C T GAT G TAG T G T T C G AC A.T A.T G C C AG C C AG AAC AC G AAT 
GTGTGTTCGCAGGGACTTTTGCGATTTTGGGAGGGGAGCTAGAAATGCCCTGGTCCTTCGA 
C AG AC TGTTCCGC TAT GAT AT C AC CAAG G T GAT GAAG C A G C T AC AC C T GAG G CAT G AC T C T 
GACTTTACCTTCAGGGTGAAGATTGTCGGCACCGACGACCACGAGCTTCCTTCAGACAGTG 
T C AAAG C AC C AAC TAT T G AAT T T G AAC C G G G C G 

Intron g/h 

GTGAGTACGACAGGCATTTCTAGTAAAAACCTACTTTTGGTAAAAGGTTCGAGAAATCACT 
T GAAG C AAC AAC AT GAT T T T G T AAC G C C TAT T AC AC G T G AAC A.T G T C AC AC C C G G T G A.T G C 
CGTTTAATGGACATGCCTCTGTTAATGAAAGGGGTAAGTACA.TGTGTATGGGGATGGGA.TG 
GGAGCCACCTGTCCCAATTTCA.TAGGTCCCTAGGATCCCAGTTGCGTAGGAATCCCCTGA.T 
TAATGCCTTGTGAATTCCTCCTGGA-ATTGTCCTGGCCCAAATTTTTACAAACCCGCCCCGA 
TATACCTTGGAAATAATTGGGCCTAAGGGTGGGGCTTTTAAGGACCAAGAACCCAACCTA 21 . 
ACCCCAACCCATTTTTTCCCACCCA.TTCCAGGTTTTGTTTTACCAAATAAAAAGGTTTCCA 
CTTTGAGGAAACCCTTTAAGGGTTCTTTTCAGGGCTTTTTTTCTTTTCTGGGAA.TTCCAAT 
T C C G G G G G AACAAAAT AC A.T AT A.T T T C AC AG AC C T T T G G T C AAAT T TAT AT AAT T T C C G AC 
TTCATGTCA/TAGGTTTGTCTTTCTTCCTACACAG 
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T G C AC AG AG G C G G AAAC C AC G AA'G AT G AAC AC CAT GAT G AC AG AC T C G C AG AT G T C C T GAT 
CAGGAAAGAAGTTGACTTCCTCTCCCTGCAAGAGGCCAACGCAATTAAGGATGCACTGTAC 
AAGCTCCAGAATGACGACAGTAAAGGGGGCTTTGAGGCCATAGCTGGCTATCACGGGTATC 
CTAATATGTGTCCAGAAAGAGGTACCGACAAGTAJTCCCTGCTGTGTCCACGGAATGCCCGT 
GTTCCCCCACTGGCACCGCCTGCATACCATTCAGATGGAGAGAGCTCTGAAAAACCATGGC 
TCTCCAATGGGCA.TTCCTTACTGGGATTGGACAAAGAAGA.TGTCGAGTCTTCCATCTTTCT 
TTGGAGATTCCAGCAACAACAACCCTTTCTACAAATATTACATCCGGGGCGTGCAGCACGA. 
AACAACCAGGGACATTAJVTCAGAGACTCTTTAATCAAACCAAGTTTGGTGAATTTGAJTAC 
CTA.TATTACCTAACTCTGCAAGTCCTGGAGGAAAACTCGTACTGTGACTTTGAAGTTCAGT 
ATGAGATCCTCCATAACGCCGTCCACTCCTGGCTTGGAGGAACTGGAAAGTATTCCATGTC 
TACCCTGGAGCA.TTCGGCCTTTGACCCTGTCTTCA.TGATTCACCACTCGAGTTTGGATAGA 
ATCTGGATCCTTTGGCAGAAGTTGCAAAAGATAAGAATGAAGCCTTACTACGCATTGGATT 
GTGCTGGCGACAGACTTATGAAAGACCCCCTGCATCCCTTCAACTACGAAACCGTTAATGA 
AG AT G AAT T C AC C C G CAT C AAC T C T T T C C C AAG CAT AC T G T T T G AC C AC T AC AG G T T C AAC 
TATGAATACGATAACA.TGAGAATCAGGGGTCAGGACATACATGAACTTGAAGAGGTAATTC 
AGGAATTAAGAAACAAAGATCGCATATTTGCTGGTTTTGTTTTGTCGGGCTTACGGATATC 
AG C T AC AG T G AAAG TAT T CAT T CAT T C G AAAAAC GAT AC AAG T C AC G AAG AAT AT G C AG G A 
GAATTTGCAGTTTTGGGAGGTGAGAAGGAGATGCCGTGGGCATATGAAAGAATGCTGAAAT 
TGGACATCTCCGATGCTGTACACAAGCTTCACGTGAAAGATGAAGACATCCGTTTTAGAGT 
GGTTGTTACTGCCTACAACGGTGACGTTGTTACCACCAGGCTGTCTCAGCCATTCATCGTC 
CACCGTCCAGCCCATGTGGCTCACGA.CATCTTGGTAATCCCAGTAGGTGCGGGCCATGACC 
T T C C G C C T AAAG T C G T A.G T AAAG AG C G G C AC C AAA G T C GAG T T T AC AC C AAT AG AT T C G T C 
GGTGAACAAAGCAATGGTGGAGCTGGGCAGCTATACTGCTATGGCTAAATGCATCGTTCCC 
CCTTTCTCTTACCACGGCTTTGAACTGGACAAAGTCTACAGCGTCGATCACGGAGACTACT 
ACATTGCTGCAGGTACCCACGCGTTGTGTGAGCAGAACCTCAGGCTCCACATCCACGTGGA 
ACACGAGTAG 



3 'UTR 
TTCACAG 
Intron UTR 

G T GAG GAG AAG G C C C C AG G C TAG C AG G G C AAT G GAT G AAG G AAAT AG G G G C AAAG G G AAT A 
G C AG T T AC AC CAT C G AC AT T T C C AAC CTCCT C AG AAAC T AAT AT AT AG C C T T AAT AC AAC C 
AGCCAAGACTCAACGGGCAGCCGGGGTGGGGGGATTTGGTGGTCGCTGTTTCAGACCAGGG 
T G C AAAAT A.T C A.G T G C G C AAAT C AAC AT GTTGCGTGT C A.G AC AC T G AC AC AG C AG T CAT T G 
AAC C T G C AGAC C C A.T AAC AG G AAAAT G G G G C AGAT AC GAT C AAAG AC AG T G T AAAAT AG GG 
AT AAG TAG G CAT AT G C AAC C AC C T GAT G G AAAT G AAAA G G G G T AJVG T T T AAAC C C C G G C T A 
CCAAAGGTCCAATGGTTCCTTAACCCAGCTTACGCTATCCCTCTAATTTCAGTATTGAGCT 
GAT T T C T G T C GAG T T C AT G T AAAC T G TAT AC T T T C T G TAT TAT T AC AG 

3 'UTR 

G T T G C TAT G C C G AC T G C G C TAT AT T G G T GAAC GAG AC G A.T GAG G AC AT C T C T G AAAG AG T T 
C G C C AAG T GAT G T G TAG G T C AC G G AAG TAT T G T T GAG C T AAC AAT AT GAT GAT T T C AAAAT 
GACTTGGCGCTCTAGGACAAAGACATAATTCATCAGCACCCTGTGCACCAACTCTTTGTTT 
GCTGCAAACGTCTGACAAGCGACA.CGTCAATCAACAAGCTGTTCAAACTCAAGTGGATGTA 
AC TA.GAAT CGTTGGGC CAT C G T T C AC AAAG TAT T G AC AG AT G T C AC ACAT GAT G G C GAGAA 
AC AC T T T AGAAC T T T T AAT G AC C TAG AG T G AX T T G T AAAT AT G T AAAT AT AT T C T T C AAAG 
AC T C AG C T G AAC T AT T G T T G G AT AAC AC AT C AAT T C C C T C AAC AAAAT G C T T T AT C T T C AC 
AT G GAT G TAT G T AAT GTGGCCGG C AAT AAAG TAT AT AT AT G T AT AAAAAAAAAAAAAAAAA 
A 
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Figure 5 



Derived primary structure of HtHl 



Signal peptide 
LVQFLLVALVAGAGA 



Domain a 

DKAA^KDVSHLTDDEVQALHGAJLHDVTASTGPLSFEDITSYK^PASCDYKGRKIACCVHG 
MPSFPFWHRAYVVQAERALLSKRKTVGMPYWDWTQTLTHLPSLVTEPI YIDSKGGKAQTNY 
WYRGEIAFINKKTAJU^VDDRLFEKVEPGHYTHLMETVLDALEQDEFCKFEIQFELAHNAIH 
YLVGGKFEYSMSNLEYTSYDPIFFLHHSNVDRLFAI-WQRLQELRGKNPNAMDCAHELAHQQ 
LQPFNRDSNPVQLTKDHSTPADLFDYKQLGYSYDSLNLNGMTPEQLKTELDERHSKERAFA 
SFRLSGFGGSAN\AATYACVPDDDPRSDDYCEKAGDFFILGGQSEMPWRFYRPFFYDVTEAV 
HHLGVPLSGHYYVKTELFSVNGTALSPDLLPQPTVAYRPGK 



Domain b 

GHLDPPVHHRHDDDLIVRKNIDHLTREEEYELRMALERFQADTSVDGYQATVEYHGLPARC 
PRPDAKVRFACCMHGMASFPHWHRLFVTQVEDALVRRGSPIGVPYWDWTKPMTHLPDLASN 
ETYVDPYGHTHHNPFFNANISFEEGHHHTSRMIDSKLFAPVAFGEHSHLFDGILYAFEQED 
FCDFEIQFELVHNSIK^WIGGSEDYSMATLHYTAFDPIFYLHHSWDRLWAIWQALQIRRK 
KPYQAHCAQSVEQLPMKPFAFPSPLNNNEKTHSHSVPTDI YDYEEVLHYS YDDLTFGGMNL 
EEIEEAIHLRQQHERVFAGFLLAGIGTSALVDIFINKPGNQPLKAGDIAILGGAKEMPWAF 
DRLYKVEITDSLKTLSLDVDGDYEVTFKIHDMHGNALDTDLIPHAAVVSEPAH 



Domain, c . 

PTFEDEKHSLRIRKNVDSLTPEETNELRKALELLENDHTAGGFNQLGAFHGEPKWCPNPEA 
EHKVACCVHGKWFPHWHRLLALQAENALRKHGYSGALPYWDWTRPLSQLPDLVSHEQYTD 
PSDHHVKHNPWFNGHIDTVNQDTTRSVREDLYQQPEFGHFTDIAQQVLLALEQDDFCSFEV 
QYEISHNFIHALVGGTDAYGMASLRYTAYDPIFFLHHSNTDRIWAIWQSLQKYRGKPYNTA 
NCAIESMRRPLQPFGLSSAINPDRITREHAIPFDVFNYRDNLHYVYDTLEFNGLSI SQLDR 
ELEKIKSHERVFAGFLLSGIKKSALVKFEVCTPPDNCHKAGEFYLLGDENEMAWAYDRLFK 
YDITQVLEANHLHFYDHLFIRYEVFDLKGVSLGTDLFKTANWHDSGT 



Domain d 

GTRDRDNYVEEVTGASHIRKNLNDLNTGEMESLRAAFLHIQDDGTYESIAQYHGKPGKCQL 
NDHNIACCVHGMPTFPQWHRLYWQVENALLNRGSGVAVPYWEWTAPIDHLPKFIDDATYF 
NSRQQRYDPNPFFRGKVTFENAVTTRDPQAGLFNSDYMYENVLLALEQENYCDFEIQFELV 
HNALHSMLGGKGQYSMSSLDYSAFDPVFFLHHANTDRLWAIWQELQRFRELPYEEANCAIN 
LMHQPLKPFSDPHENHDNVTLKYSKPQDGFDYQNHFGYKYDNLEFKHLSI PSLDATLKQRR 
NKDRVFAGFLLHNIGTSADITI YICLPDGRRGNDCSHEAGTFYILGGETEMPFI FDRLYKF 
EITKPLQQLGVKLHGGVFELELEIKAYNGSYLDPHTFDPTI I FEPGT 



Domain e 
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DTHILDKDHEEEILVRKNIIDLSPRERVSLVKALQRMKNDRSADGYQAIASFHALPPLCPN 
PSAAHRYACCVHGMATFPQWHRLYTVQVQDALRRHGSLVGIPYWDWTKPVNELPEL-LSSAT 
FYHPIRNINISNPFLGADIEFEGPGVHTERHINTERLFHSGDHDGYHNWFFETVLFALEQE 
DYCDFEIQFEIAHNGIHTWIGGSAVYGMGHLHYASYDPIFYIHKSQTDRIWAIWQELQKYR 
GLSGSEANCAIEHMRTPLKPFSFGPPYNLNSHTQEYSKPEDTFDYKKFGYRYDSLELEGRS 
ISRIDELIQQRQEKDRTFAGFLLKGFGTSASVSLQVCRVDHTCKDAGYFTILGGSAEMPWA 
FDRLYKYDITKTLHDMNLRHEDTFSIDVTITS YNGTVLSGDLIQTPSI IFVPGR 

Domain f 

HKLNSRKHTPNRVRHELSSLSSRDIASLKAALTSLQHDNGTDGYQAIAAFHGVPAQCHEPS 
GREIACCIHGMATFPHWHRLYTLQLEQALRRHGSSVAVPYWDWTKPITELPHILTDGEYYD 
WQNAVLANPFARGYVKIKDAFTVRNVQESLFKMSSFGKKSLLFDQALLALEQTDYCDFEV 
QFEVMHNT I H YLVGGRQT YAFS S LE YS S YDP I FFI HHS FVDKI WAVWQELQSRRHLQFRTA 
DC^VGIjMGQAMRPFNKDFNHNSFTKKHAVPNTVFDYEDLGYNYDNLEISGLNLNEIEALIA 
KRKSHARVFAGFLLFGLGTSADIHLEICKTSENCHDAGVIFILGGSAEMHWAYNRLYKYDI 
TEALQEFDINPEDVFHADEPFFLRLSWAVNGTVIPSSHLHQPTIIYEPGE 



Domain g 

DHHDDHQSGSIAGSGVRKDVNTLTKAETDNLREALWGVMADHGPNGFQAIAAFHGKPALCP 
MPDGHNYSCCTHGMATFPHWHRLYTKQMEDAMRAHGSHVGLPYWDWTAAFTHLPTLVTDTD 
NNPFQHGKIDYLNVSTTRSPRDMLFNDPEHGSESFFYRQVXLALEQTDFCKFEVQFEITHN 
AIHSWTGGHSPYGMSTLDFTAYDPLFWLHHSNTDRIWAVWQALQEYRGLPYNHANCEIQAM 
KTPLRPFSDDINHNPVTKANAKPLDVFE YNRLSFQYDNLI FHG YS I PELDRVLEERKEEDR 
IFAAFLLSGIKRSADWFDICQPEHECVFAGTFAILGGELEMPWSFDRLFRYDITKVMKQL 
HLRHDSDFTFRVKIVGTDDHELPSDSVKAPTIEFEPG 



Domain h 

VHRGGNHEDEKHDDRLADVLIRKEVDFLSLQEANAIKDALYKLQNDDSKGGFEAIAGYHGY 
PNMCPERGTDKYPCCVHGMPVFPHWHRLHTIQMERALKNHGSPMGI PYWDWTKKMSSLPSF 
FGDSSNNNPFYKYYIRGVQHETTRDVNQRLFNQTKFGEFDYLYYLTLQVLEENSYCDFEVQ 
YEILHNAVHSWLGGTGQYSMSTLEYSAFDPVFMIHHSSLDRIWILWQKLQKIRMKPYYALD 
CAGDRLMKDPLHPFNYETVInTEDEFTRINSFPSILFDHYRFNYEYDNMRIRGQDIHELEEVI 
QELRNKDRIFAGFVXSGLRISATVKVFIHSKNDTSHEEYAGEFAVLGGEKEMPWAYERMLK 
LDISDAVHKLHVKDEDIRFRVWTAYNGDWTTRLSQPFIVHRPAHVAHDILVIPVGAGHD 
L P PKVWK S G TKVE FT P I D S S VNKAMVE L G S Y TAMAKC I VP P FS YKGFELDKVYSVDHGDY 
YIAAGTHALCEQNLRLHIHVEHE 
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Figure 6 

HtH2 cDNA sequence and intron structure 



Domain b 

CACAGACTGTTCGTCACCCAGGTGGAAGATGCTCTGATCAGGCGAGGATCGCCTATAGGGG 
TCCCCTACTGGGACTGGACTCAGCCTATGGCGCATCTCCCAGGACTTGCAGACAACGCCAC 
C TAT A GAG AT C C CAT C AG C G G G G AC AG C AG AC AC AAC C C C T T C C AC GAT G T T G AAG T T G C C 
T T T GAAAAT G G AC G T AC AG AAC G T C AC C C AG AT AG TAG AT T G T T T G AAC AAC C T T TAT T T G 
GCAAACATACGCGTCTCTTCGACAGTATAGTCTATGCTTTTGAGCAGGAGGACTTCTGCGA 
TTTTGAAGTTCAATTTGAGATGACCCATAATAATATTCACGCCTGGATTGGTGGCGGCGAG 
AAGTATTCCATGTCTTCTCTACACTACACAGCCTTCGACCCTATCTTCTACCTTCGTCACT 
CCAACACTGACCGGCTCTGGGCAATTTGGCAAGCGTTGCAGATACGAAGAAACAGGCCTTA 
CAAGGCTCATTGTGCTTGGTCTGAGGAACGCCAGCCTCTCAAACCTTTCGCCTTCAGTTCC 
C C AC T G AAC AAC AAC G AAAAAAC C T AC G AAAAC TCGGTGCC C AC C AAC G T T TA.C G AC T AC G 
AAGGAGTCCTTGGCTATACTTATGATGACCTCAACTTCGGGGGCATGGA.CCTGGGTCAGCT 
T GAG G AAT AC AT C C AG AG G C AG AG AC AG AGAG AC AG G AC CTTTGCTGGTTTCTTTCTGT C A 
CAT AT TGG T AC AT C AG C G AAT G T T GAAAT CAT T AT AGAC CAT G G G AC T C T T CAT AC C T C C G 
TGGGCACGTTTGCTGTTCTTGGCGGAGAGAAGGAGATGAAATGGGGATTTGACCGTTTGTA 
CAAATATGAGATTACAGATGAACTGAGGCAACTTAATCTCCGTGCTGATGATGTTTTCA.GC 
ATCTCTGTTAAAGTAACTGATGTTGATGGCAGTGAGCTGTCCTCTGAACTCATCCCATCTG 
C T G C TAT CAT C T T C G AAC G AAG C C ATA. 

Intron b/c 

GTAAGTAGCTACCTGTTTATTCAATTTTTTCGCTTTGCCAATCAATTCATTCAGCTTGAAA 
TTCAATAATTGTGTTTTGCATGGCTGAAAACCAATTTGAACTCTTTTCTTTTCTCAGGTCG 
AAC T C AAAT AAAT AAT C AC T AAT T G T TAT G C AC G C G G G TAG G G C AT AC A.T AC TAT AT C C AC 
ATCGGTCATCTCAAAATGCAAACAAATTGTCTTATTTCCGTTGGGACAAGCAAACCCCCTT 
TCCTGTAATCTTGCCTTTGGCA.TCCACTGGAATTAATGTTGACTGGTAATTGATACTGGCT 
CTCTTCTTGCATAGAGTTAATA.TCTATAGTTTGTAAATCTTTATGATTTTGCTATTTATAT 
T T C G AC AG CAT G C TAT AG AC AC C C TAG AC TAT T G T A.T A G C C AC T T G TAT TGTTTTTC CAT T 
TAT TAT T TAT AAC AG AAC AT G G C T T G T AAT T T T TAT T T AC C T T C C AG 



Domain c 

T T G AC CAT C AG G AC C C G CAT CAT G AC AC AAT CAT TAG G AAAAAT G T T GAT AAT C T TAG AC C 
CGAGGAAATTAATTCTCTGAGGCGGGCAATGGCAGACCTTCAATCAGACAAAACCGCCGGT 
GGATTCCAGCAAATTGCTGCTTTTCACGGGGAACCCAAATGGTGCCCAAGTCCCGATGCTG 
AGAAGAAGTTCTCCTGCTGTGTCCATGGAATGGCTGTCTTCCCTCACTGGCACAGACTCCT 
GACCGTGCAAGGCGAGAATGCCCTGAGAAAGCATGGATGTCTCGGAGCTCTCCCCTACTGG 
GACTGGACTCGGCCCCTGTCTCACCTACCTGATTTGGTTTTGGTAAGTAGCAGAACTACAC 
C GAT G C CAT AT T C C AC C G T G G AAG C C C G AAAC C C C T G G T AC AG C G G C C A.T AT T GAT AC AG T 
TGGTGTTGACACAACAAGAAGCGTCCGTCAAGAACTGTA.TGAAGCTCCTGGATTTGGCCAT 
TATACTGGGGTCGCTAAGCAAGTGCTTCTGGCTTTGGAGCAGGATGACTTCTGTGATTTTG 
AAGTCCAGTTTGAGATAGCTCACAATTTCATTCACGCTCTTGTCGGCGGAAGCGAGCCATA 
TGGTA.TGGCGTCACTCCGTTACACTACTTATGATCCAATTTTCTACCTCCATCATTCTAAC 
AC T G AC AG AC T C T G G G C TAT AT G G C AG G C T C T AC AAAAG T AC AG G G G C AAAC C T T AC AAT T 
CCGCCAACTGCGCCATTGCTTCTATGAG AAAAC CCCTACAACCCTTTGGTCTGACTGAT GA 
G AT C AAC C C G G AT G AT G AG AC AAG AC AG C AT G C T G T T C C T T T C AG T G T C T T T G AT T AC AAG 
AAC AAC T T C AAT TAT G AAT AT G AC AC C C T T G AC T T C AAC G G AC TAT C AAT C T C C C AG C T G G 
ACCGTGAACTGTCACGGAGAAAGTCTCATGACAGAGTATTTGCCGGATTTTTGCTGCATGG 



^ 
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TATTCAGCAGTCTGCACTAGTTAAATTCTTTGTCTGCAAATCAGATGATGACTGTGACCAC 
TATGCTGGTGAATTCTACATCCTTGGTGATGAAGCTGAAATGCCATGGGGCTATGATCGTC 
T T T AC AAAT AT GAG AT C AC T GAG C AG C T CAAT G C C C T G GAT C T AC AC AT C G GAG AT AG AT T 
CTTCATCAGATACGAAGCGTTTGATCTTCATGGTACAAGTCTTGGAAGCAACATCTTCCCC 
AAAC C T T C T G T C AT AC AT G AC G AAG G G G C AG 

Intron c/d 

G T G AG AAC AT T G AT AAT AG T T C AAAT g AAG TAT AT C C GAT T C AAG C T G T C GAT AC AAG AT g 
AG AT A C AT AAT C AX AAT G T T T G T AT TAG AT AT C T C T C T T AAT T T AAT GCCGCTTT TAT C AA 
TAT T C GAG CAAT C C T T C AG C AAC AT AC AC C AG C AAAT G T T T CAT C AAC AG AC TAT AT TAT T 
TAATCTTTTAAAAATCCTTTTCTGTTGTTATAAATACTTAAAGTATCGAATTCCTTGAATG 
CGTCTTCTCTGCAGCATATAGTTAAGTTGTTGTGTTTCTCTGTCAG 



Domain d 

G T C AC CAT C AG G C T G AC GAG T AC G AC G AAG T T G T AAC T G C T G C AAG C C AC AT C AG AAAG AA 
TTTAAAAGATCTGTCAAAGGGAGAAGTAGAGAGCCTAAGGTCTGCCTTCCTGCAACTTCAG 
AACGACGGAGTCTATGAGAATATTGCCAAGTTCCACGGCAAGCCTGGGTTGTGTGATGATA 
ACGGTCGCAAGGTTGCCTGTTGTGTCCATGGAATGCCCACCTTCCCCCAGTGGCACAGGCT 
CTATGTCCTCCAGGTGGAGAATGCTTTGCTGGAGAGAGGATCTGCCGTCTCTGTGCCATAC 
TGGGACTGGACTGAAACATTTACAGAGCTGCCATCTTTGATTGCTGAGGCTACCTATTTCA 
ATTCCCGTCAACAAACGTTTGACCCTAATCCTTTCTTCAGAGGTAAAATCAGTTTTGAGAA 
TGCTGTTACAACACGTGATCCCCAGCCTGAGCTGTACGTTAACAGGTACTACTACCAAAAC 
GTCATGTTGGTTTTTGAACAGGACAACTACTGCGACTTCGAGATACAGTTTGAGATGGTTC 
ACAATGTTCTCCATGCTTGGCTTGGTGGAAGAGCTACTTATTCTA.TTTCTTCTCTTGATTA 
TTCTGCATTCGACCCTGTGTTTTTCCTTCACCATGCGAACACAGATAGATTGTGGGCCATC 
T G G C AG GAG C T G C AG A G G T AC AG G AAG AAG C CAT AC AAT G AAG C G GAT T G T G C C AJT T AAC C 
TAATGCGCAAACCTCTACATCCCTTCGACAACAGTGATCTCAATCATGATCCTGTAACCTT 
T AAAT AC T C AAAAC C C AC T GAT G G C T T T G AC T AC C AGAAC AAC T T T G GAT AC AAG TAT G AC 
AAC C T T GAG T T C AAT CAT T T C AG TAT T C C C AG GC T T G AAg AAAT CAT T C G t AT TAG AC AAC 
GTCAAGATCGTGTGTTTGCAGGA.TTCCTCCTTCACAACATTGGGACA.TCCGCAACTGTTGA 
GATATTCGTCTGTGTCCCTACCACCAGCGGTGAGCAAAACTGTGAAAACAAAGCCGGAACA 
TTTGCCGTACTCGGAGGAGAAACAGAGATGGCGTTTCA.TTTTGACAGACTCTACAGGTTTG 
AC AT C AG T G AAAC AC T GAG G G AC C T C G G CAT AC AG C T G G AC AG C CAT G AC T T T G AC C T C AG 
CAT C AAG AT T C AAG GAG T AAAT G GAT C C T AC C T T GAT C C AC AC AT C C T G C C AG AG C CAT C C 
TTGATTTTTGTGCCTGGTTCAAGT 

Intron d/e 

AAG AAAG T T T C AC T G T C T AAAT C T T T T T T TAT GAT AG AG G G TAG AG AAG T G GAG AC AAT G T 
G A CAAT AT AT T G AAT AAAG T T G T T TAAAAT T TAT AAC T C T CAT AAG T T CAT AT TAT G C T G A 
AG C T G TAG C CAT C TAT AAC T G T G T AAC AT G AAAT G T TAAG AC AT T AAC C T AAAT AC T T C AG 
C T GAT AAC AAAAC AAT G T T AAT AC AT AC G T CAAT G T AAC ATT T T C T TAT C T T TAG G T TATA 
G CAT AAAC AC T T C AG AG AT AC AG T G AC G AAAAC C T C TAT T T AAAT AT T T C AG G T 



Domain e 

TCTTTCCTGCGTCCTGATGGGCATTCAGATGACATCCTTGTGAGAAAAGAAGTGAACAGCC 
T G ACAAC C AG G GAG AC T G CAT C T C T GAT C CAT G C T C T GAAAAG TAT G C AG G AAGAC CAT T C 
ACCTGACGGGTTCCAAGCCATTGCCTCTTTCCATGCTCTGCCACCACTCTGCCCTTCACCA 
TCTGCAGCTCACCGTTATGCTTGCTGTGTCCACGGCATGGCTACATTTCCCCAGTGGCACA 
GAT T G T AC AC T G T AC AG T T C C AG GAT G C AC T GAG GAG AC AT G GAG C T AC G G TAG G T G T AC C 
GTATTGGGATTGGCTGCGACCGCAGTCTCACCTACCAGAGCTTGTCACCATGGAGACATAC 
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CAT GAT AT T T G GAG T AAC AG AG A£T T C C C C AAT C C T T T C T AC C AAG C C AAT AT T GAG T T T G 
AAG G AG AJLAAC A.T T AC AAC A G AG AG AG AAG T CAT T G C AG AC AAAC T T T T T G T C AAAG G T G G 
ACACGTTTTTGATAAACTGGTTCTTCAAACAAGCCATCCTAGCGCTGAGCAGGAAAACTAC 
TGTGACTTTGAGATTCAGTTTGAAATTCTTCACAACGGCGTTCACACGTGGGTCGGAGGCA 
GTCGTACCTACTCTATCGGACA.TCTTCATTACGCATTCTACGA.CCCTCTTTTCTACCTTCA 
C CAT T T C C AG AC AG AC C G TAT T T G g G C AAT C T G G C AAG AAC T C C AG G AAC AG AG AG G G C T C 
TCGGGTGATGAGGCTCACTGTGCTCTCGAGCAAATGAGAGAACCATTGAAGCCTTTCAGCT 
.TCGGCGCTCCTTATAACTGGAATCAGCTCACACAGGATTTCTCCCGACCCGAGGACACCTT 
CGACTACAGGAAGTTTGGTTATGAATATGACAATTTAGAATTCCTGGGAATGTCAGTTGCT 
G AAC T G GAT C AAT AC AT TAT T GAACAT C AAG AAAAT GAT AG AG TAT TCGCTGGGTTCCTGT 
TGAGTGGATTCGGAGGTTCCGCATCAGTTAATTTCCAGGTTTGTAGAGCTGATTCCACATG 
TCAGGATGCTGGGTACTTCACCGTTCTTGGTGGCAGTGCTGAGATGGCGTGGGCATTTGAC 
AG G C T T T AC AAAT AT G AC AT T AC T GAAAC T C T G GAG AAAAT G CAC C T T C GAT AT GAT GAT G 
ACTTCACAATCTCTGTCAGTCTGACCGCCAACAACGGAACTGTCCTGAGCAGCAGTCTAAT 
C C C AAC AC C GAG T G T CAT AT T C C AG C G G G G AC AT C 

Intron e/f 

AAG TAG TA-AAC T G C T C AG AT T G T T T T CAT AAT T AC T C CAC TAT T AAG T AAAAAG T AC TAG T 
AAT T C AAT AG T AC T G T T CAC AG AG AAAT G T AAC AC AAT AG AC CAC AG AG T C CAT T T G T T AA 
ACGCCTTTGGCTTGGTAAGTCTGAGGTTTTGGTGACTGATGGAAAGCTAAAATATA.TTTTG 
ACAG 



Domain f (1) 

GTGACATAAATACCAGGAGCATGTCACCGAACCGTGTTCGCCGTGAGCTGAGCGATCTGTC 
TGCGAGGGACCTGTCTAGTCTCAAGTCTGCTCTGCGAGA.ee TACAGGAGGATGATGGCCCC 
AACGGATACCAGGCTCTTGCAGCCTTCCATGGGCTACCAGCAGGCTGCCATGATAGCCGGG 
GAAAT GAG AT 

Intron f 

ATA.TTTAAAGTA.TTTTA.TCTTACGCATGACCCTGACCCTATTATTTTTTTAATCCTATGAT 
GAAACATTTACTTAGACTGGCTTGTGAGCCCCAGGCAAAATGCACTGTAAAAATACA.CTGA 
C AG AG GAT T A G G CAT T C T T G G GAG T AC T G TAT AG T TAG T T G CAT AC AT AT TAG CGTTCCCT 
CACTAAAACGAATCTCTGAATGCTATCAATTAAAGATCATGATGCTTTGATTGTGTCTACT 
G TAT T T AAAAT G G T G T T AAG AT T T G C AAT T AC AAT A.T AC AC AAAC AC G T T T C C T G C AT C T C 
GGAGAATGCAATCTTTCGTTGTACGCGTCTGTTTTCA.TATTTTTATGCA.TGTA.GTTTGCAC 
T AC T TAG C G T C C AAT AAAT C CAT T CAC AAAAT CAC AC AAAC AAAC GAT T T TAG G AAT G T G A 
CTGTAGCTGCAACGAATATACCTGATCCTTTCTTGTTCCAGAT 

Domain f (2) 

CGCATGTTGCATTCACGGGATGCCGACCTTCCCCCAGTGGCACAGACTGTACACCCTGCAG 
TTGGAGATGGCTCTGAGGAGACATGGATCATCTGTCGCCA.TCCCCTACTGGGACTGGACAA 
AGCCTATCTCCGAACTCCCCTCGCTCTTCACCAGCCCTGAGTA.TTATGACCCATGGCATGA 
TGCTGTGGTAAACAACCCATTCTCCAAAGGTTTTGTCAAATTTGCAAATACCTACA.CAGTA 
AGAGACCCACAGGAGA.TGCTGTTCCAGCTTTGTGAACATGGAGAGTCAATCCTCTA.TGAGC 
AAACTCTTCTTGCT.CTTGAGCAAACCGACTACTGTGATTTTGAGGTACAGTTTGAGGTCCT 
CCATAACGTGA.TCCACTACCTTGTTGGTGGACGTCAGACCTACGCATTGTCTTCTCTGCA.T 
TATGCCTCCTACGACCCATTCTTCTTTATACACCATTCCTTTGTGGATAAGATGTGGGTAG 
TAT G G C AAG C T C T T C AAAAG AG GAG GAAAC T T C CAT AC AAG C GAG C T G AC TGTgcTGT C AA 
C C T AAT G AC T AAAC C AAT GAG G C C AT T T G AC T C C GAT AT G AAT C AG AAC C CAT T CAC AAAG 
AT G CAC G C AG T T C C C AAC AC AC T C TAT G AC T AC GAG AC AC T G T AC T A C AG C T AC GAT AAT C 
T C GAAAT AG G T G G C AG G AAT C T C G AC C AG C T T C AG G C T GAAAT T G AC AG AAG C AG AAG C C A 
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CGATCGCGTTTTTGCTGGATTCTTGGTTCGTGGAATCGGAACTTCTGCTGATGTCAGGTTT 
TGGATTTGTAGAAATGAAAATGACTGCCACAGGGGTGGAATAATTTTCATCTTAGGTGGAG 
C ChAG G AAAT G C CAT G G T CAT T T G AC AG AAAC T T C AAG T T T GAT AT C AC C CAT G T AC T C G A 
GAATGCTGGCATTAGCCCAGAGGACGTGTTTGATGCTGAGGAGCCATTTTATATCAAGGTT 
GAGATCCATGCTGTTAACAAGACCATGATACCGTCGTCTGTGATCCCAGCCCCAACTATCA 
TCTATTCTCCTGGGGAAG 

Intron f/g 

GT G AG AGAAC C AG T AAT AG C T AC T G T C T AC AAAGAAT G T G T T CAT T T AAAG AC C T G AC T G T 
AGGCCGATGGCTGCTGTCATCTCCTCCGCCTCCTCCTCCTGTTCCTCCTCCGAAGGGGTCA 
GCTTCAGGTTCTCTTGCCAATATGCCAAGCAGACCTCCTGAGCAGGCAGTATATATACGTA 
AG G GAAG C AAG TAT G G AC CAT CGCGCGG CAT G TAG AG AT AC AAT GAT C AG CTGTCTGCTGT 
T C C AC T C C T G T C AG AC AAT GAG AT AAAC AT G AAT AC AG TAT T AC T C AG C AG C G T T C C AAT T 
TTCAACCCTCGTATTTATTAAAAAAAGGAATTTTTAATATATTTTTCTCCTTGTTGAAATA 
TTTTAGTAACTGTTAATCGATATAGAGTGGAGTAGTGACGCTTTA.TTTCGGtTCATTCTCG 
AAACAAAAATA.TAATAGTCCACTGAACTCTCTTAAATTGTTTTTACAACCTTCAACTGCCA 
CAGACGTAATCCCTCACGTTATTTTGAGCTGACAACGTGTTGAATTGAGTGTGTTCCGAAT 
T C T AAAT AAG CAT G TAT AT AT T T AC G T C T CAT G C AAG T AAT AT AT G T T T AAC T GAT G AC G T 
CACTTGGTGACCACTGATTTAGTTCCTTTGTCATAATTGCAGTTTCTGTTGTCACGGGGAC 
GGTGGGGAAGCCAGGTTCCTCCTGTCACGCTGAATATCCCGTTCGAATCCCCCACATGGGT 
ACAAAGTGTGATGCCTATTTCTGGTGTCCCCCACCGTGATATTGCTGGAATAAGTGGCTTA 
AT AC CAT AT AC AC T C AC T C TAT T G T C AC AC T AC T G C C AC C G G C T C AC AC C T C T GAT G C T T C 
TG T T C TAT C C AG 



Domain g(l) 

GTCGCGCTGCTGACAGTGCGCACTCTGCCAACATTGCTGGCTCTGGGGTGAGGAAGGACGT 
CACGACCCTCACTGTGTCTGAGACCGAGAACCTAAGACAGGCTCTTCAAGGTGTCATCGAT 
GA-TACTGGTCCCAATGGTTACCAAGCAATAGCATCCTTCCACGGAAGTCCTCCAATGTGCG 
AGATGAACGGCCGCAAGGTTGCCTGTTGTGCTCACG 

Intron g(l) 

G T AAT TAAT G GAT G T GAAG T C AAT G T C C GAG G G TAT AAT AAG GAT T TAAAT AC T T C AG T C G 
TGTAATACTGTATGACATGTGTATTGGATGGTGTAGGTATTACAGGTTATAAGGCCAGTGT 
GTGTTGGGACGGTTACTTTCCTGCACTAGTAATAAGCATTGTATTTAGCTAGCTTTTATCA 
TA*TAACTTTAGTTTCAGGTTTGtGGCAATTGAAATCGAAATTTTCTTTCATTTCAAGGTTA 
T C G C AC T C G T G T G TNAG AAT AG T T AC TAT G C T G C ATT G AGAAT AAC AC TAT AG TAAT AAAG 
CAT AT CAT AC AG T AAG AAT AAC AC TAT AG TAAT AAAG T A.T AT C ATN C AG T AAG AAT G T CAT 
T G TAT GAT AAAT AG G T TAT C AC AC TCGTGTGTTT TAG AAT G G T T AC TAT C C C AG G AAT AAC 
C AC TAT G TAT T AC AT G TAT AT T G G G C AG T G T AAG TAG TAG C A.T T G TAT AT TAAAT C AG TAT 
AT C G T G C T T C AAAAC AC C AG GAT AT AT G G G G TAT AC AG T G G G C AG T G T AAG TAG C AAC AT T 
G TAT AT TAAAT C AG TAT AT C G T AC T T C AAAAC AC C AG G A.T T A.T G G G G TAT AC AG T G G G C AG 
T G T AAG TAG TAG CAT T G T AT A.T TAAAT C AG TAT AT CG T AC T T C AAAAC AC C AG GAT AT AAT 
T C AG TAT AT C G T G C T T C AAAAC AC C AG GAT AT AAT T C AG TAT AT C G T G C T T C AAAAC AC C A 
G GAT AT AT G G GAT AT AC AG TGCGGGTTTG CAT AC AAC C T C C AC C C T T T AC AG 



Domain g (2 ) 

GTATGGcCTCCTTCCcACACTGGCACAGACTGTATGTGAAGCAGATGGAAGATGCCCTGGC 
TGACCACGGGTCACATATCGGCATCCCTTACTGGGACTGGACAACTGCCTTCACAGAGTTA 
CCCGCCCTTGTCACAGACTCCGAGAACAATCCCTTCCATGAG 
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Intron g(2) 

GTCAGTTTAGTCTCCTGTCTGAGCTAACGATACCAATTTCCTATTTTCGAGAACCACGATG 
AC G AG AAAAC AAG CAAT AT AG AT AT AG AT G C AG T AT AGAT C AAG T TAAT G AAT T CAT T G C T 
AT AT GTTTGCTTG T AAT AAAC T T T AAG AAAAC G AG A.G CAT G C AC AC AAAT G AAAC AAAC AA 
TTATGTGTTTGATAGGAATATGATATATGTATTTGGGGGCTGACGTGAGCAGGGTTGAAGG 
G AC AG T T T AC AT T G T C AG T AAC AC T G G GAG TAT T C T T T GAT C C AC AAT AT AT AG T T T CAT T 
G T G T T C AG C AG T T AC AAC T AAC AT TAT AT CAT AC AT T AC G T C G t AAC AT GCTTCTTTTGTC 
CTCTTTTGCCAG 



Domain g(3) 

GGTCGCA.TTGATCATCTCGGTGTAACCACGTCACGTTCCCCCAGAGACATGCTGTTTAACG 
ACCCAGAGCAAGGATCAGAGTCGTTCTTCTATAGACAAGTCCTCCTGGCTTTGGAGCAGAC 
TGACTACTGCCAGTTCGAAGTCCAGTTTGAGCTGACCCACAACGCCATTCACTCCTGGACA 
GGTGGACGTAGCCCTTACGGAATGTCGACCCTCGAGTTCACAGCCTACGATCCTCTCTTCT 
GGCTTCACCACTCCAACACCGACAGAATCTGGGCTGTCTGGCAAGCACTGCAGAAATACCG 
AGGACTCCCATACAACGAAGCACACTGTGAAATCCAGGTTCTGAAACAGCCCTTGAGGCCA 
T T C AAC GAT G AC AT C AAC C AC AAT C CAAT C AC C AAG AC TAAT G C C AG G C C TAT C GAT T CAT 
T T GAT TAT GAG AG G t T T AAC TAT C AG TAT G AC AC C C T TAG C T T C CAT G G T AAG AG CAT C C C 
T G AAC T G AAT G AC C T G C T C GAG G AAAG AAAAAG A.G AAG AG AG AAC AT TTGCTGcCTTCCTT 
CTTCGTGGAATCGGTTGCAGTGCTGATGTCGTCTTTGACATCTGCCGgCCCAATGGTGACT 
GTGTCTTTGCAGGAACCTTTGCTGTGCTGGGAGGGGAGCTaGAAATGCCTTGGTCCTTCGA 
C AG AC TGTTCCGC TAT G AC AT C AC C AG AG T CAT G AAT C AG C T C CAT C T C C AG TAT GAT T C A 
GATTTCAGTTTCAGGGTGAAGCTTGTTGCCACCAATGGCACTGAGCTTTCATCAGACCTtC 
T C AAG T C AC C AAC AAT T G AAC AT G AAC T T G G 

Intron g/h 

G TAT G T TAT C T TAT CAT C AAAT G T G T GAT C AG AT AC T G GAG AC G T T T T CAT AT T AA C T T G G 
T C AG CAT TAG T T GAT GAT TTTGGTGC GAT G T T G AC G AC AAG GAG T C AAG CAT T AAC AC AT T 
C AAC AC AT C T T TAAT C T GAT AT GAG AAG G G AAT AAAT T GAT C C A>G TAT T GAT GAT T G AAG T 
TAG AT T AAC AG T GAAAGAT AT AC C AG T T T T GAT AAT C G TAT AAAAC AG TAG C AG AAT T G T A 
TCGTGAAAACTAAATGTGGGAAGGCGAACGCCAAGCAGATTTTAGATTACGATCGTGTGCT 
AG AAT AAT T C AC AAT AAC C C AG AC G T C G G AAAT GTGGTTGTC TAT G G CAAT G G T T AC GAT T 
AAT T G C T AAC AT G C AC GAT T TAX C TAT T T C AG 



Domain h 

AG C C C AC AG AG G AC C AG T T G AAG AAAC AG AAG T C AC T C G C C AAC AT AC T G AC G G CAAT G C A 
CACTTTCATCGTAAGGAAGTTGATTCGCTGTCCCTGGATGAAGCAAACAACTTGAAGAATG 
C C C T T T AC AAG C T AC AG AAC G AC C AC AG T C T AAC G G GAT AC G AAG C AJlT C T C T G G T T AC C A* 
TGGATACCCCAATCTGTGTCCGGAAGAAGGCGATGACAAAATACCCCTGCTGCGTCCCCGG 
ATGGGCATCTTTCCTTACTGGCACAGACTCTTGACCATTCAACTGGAAAGAGCTCTTGAGC 
ACAATGGTGCACTGCTTGGTGTTCCTTACTGGGACTGGAACAAGGACCTGTCGTCACTGCC 
GGCGTTCTTCTCCGACTCCAGCAACAACAATCCCTACTTCAAGTACCACATCGCCGGTGTT 
G G T C AC G AC AC C G T C AG AG AG C C AAC TAG T C T TATA TAT AAC C AG C C C C AAAT C CAT G G T T 
AT GAT T AT CT C TAT T AC C TAG CAT T G AC C AC G C T T G AAG AAAAC AAT T AC T GG G AC T T T G A 
GGTTCA.GTATGAGATCCTCCACAACGCCGTCCACTCCTGGCTTGGAGGATCCCAGAAGTAT 
TCCATGTCTACCCTGGAG TATTCGGCCTTTGACCCTGTCTTTATGATCCTTCACTCGGGTC 
TAG AC AG AC T T T G GAT CAT C T G G C AAG AAC T T C AG AAG AT C AG GAG AAAG C C C T AC AAC T T 
CGCTAAATGTGCTTATCATATGATGGAAGAGCCACTGGCGCCCTTCAGCTATCCATCTATC 
AACCAGGACGAGTTCACCCGTGCCAACTCCAAGCCTTCTACAGTTTTTGACAGCCATAAGT 
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T C G GC T AC CAT T AC G AT AAC C T G AAT G TT AGAjG G T C AC AG CAT C C AAGAAC T C AAC AC AAT 
CAT C AAT G AC T T G AG AAAC AC AG AC AG AAT C T AC G C AG GAT TTGTTTTGT C AG G CAT C G G T 
AC G T C T G C TAG T G T C AAG AT C TAT C T C C G AAC AG AT G AC AAT G AC G AAG AAG T T G G AAC T T 
TCACTGTCCTGGGAGGAGAGAGGGAAATGCCATGGGCCTACGAGCGAGTTTTCAAGTATGA 
CAT C AC AG AG G T T G C AG AT AG AC T T AAAAT T AAG T TAT G G G G AC AC C C T T T AAC T T C C G G A 
ACTGGAGATCACATCCTTACGAATGGAATCGGTGGTAAACAAGAGCCTACCCAAATCCTTT 
CAT CAT C T AC AG AC C T G C C AAT CAT G AC T AC GAT GTTCTTGT TAT C C C AG TAN G G AAG AAA 
CCTTCACATCCCTCCCAAAGTTGTCGTCAAGAAAGGCACCCGCATCGAGTTCCACCCAGTC 
GATGATTCAGTTACGAGACCAGTTGTTGATCTTGGAAGCTACACTGCACTCTTCAACTGTG 
TGGTACCACCGTTCACATACCACGGATTCGAACTGAACCACGTCTATTCTGTCAAGCCTGG 
T G AC T AC TAT G T T AC T G G AC C C AC GAG AG AC C T T T G C C AG AAT G C AG AT G T C AG GAT T CAT 
AT C CAT G T T GAG GAT GAG TAA 



3'UTR 

CGCAACAGGT 
Intron UTR 

GAG AT AAG AAAC C C T T C T AAC AG T AAT AC G AC AC C AC AT T AC AG C T T AAAC AT GAT T G C C A 
TCGATGTTTTCATGTGTAGTATACGCTTTTCAGTTCTACATAATTTTGTTTTTCAAATCAA 
GTTTAGCAAATGAATCTATCACTGGAAAATAGGGTAGGGTAGCCAAGTGGTTAAAGCGGTC 
ACTGATCACGCCAAAGACGAGTGTCCTAACCTGCATGGGTACAAAAGTGAAGACCATTGCT 
GGTGTCTACCGCCGTAATATTGTTTTTAGTATTGCTAAAACTTATACTCACCCATGCGCTG 
T AAAAG T G GAAT AAT AAT CAT AT T T C AAC AAAAGC AC AAAAC CAT T T CAT T T T CAT GAAAG 
CCTCTTGTT C AC C T G AAAG AC G C AAG AG AAC AAT AG T T C C T AAC AT TAT T T T C AG AC AT T G 
GAAATGTCCTGCACGTGTAAACCATATATCCTTTGAAATTTTTACGACTGCATCGTATACA 
AT T TAT GAT AT AAAT T T AAAAC T T TAT 



3 * UTR 

TTCTTGGTCTCCACATA.TTCACATATCAGCACCAAATGGTTTCGAAGGACATTGGCGTTCT 
TCTCTGGCAATGCATTTCAATACAACATTGAAAATGACTTCAGCATATCAGTGTGCTTCGA 
ACGTGTTCCGGAAGTACTCAAATGTGCTATGACTGAATTATTGTACATACATAACTTATTG 
AT G T T C AAT AAAT AAAT G T T G AAAC G AAAAAAAAAAAAAAAAAAAAAAA 
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Figure 7 

Derived primary structure of HtH2 



Domain b 

HRLFVTQVEDALIRRGSPI GVPYWDWTQPMAHLPGLADNATYRDPISGDSRHNPFHDVEVA 
FENGRTERHPDSRLFEQPLFGKHTRLFDSIVYAFEQEDFCDFEVQFEMTHNNIHAWIGGGE 
KYSMSSLHYTAFDPIFYLRHSNTDRLWAIWQALQIRRNRPYKAHCAWSEERQPLKPFAFSS 
PLNNNEKTYENSVPTNVYDYEGVLGYTYDDLNFGGMDLGQLEEYIQRQRQRDRTFAGFFLS 
KIGTSANVEI IIDHGTLHTSVGTFAVLGGEKEMKWGFDRLYKYEITDELRQLNLRADDVFS 

I S VKVT DVDG SELSSELIP SAAI I FERS H 



Domain c 

IDHQDPHHDTIIRKWDNLTPEEIKSLRRAMADLQSDKTAGGFQQIAAFHGEPKWCPSPDA 
EKKFSCCVHGMAVFPHWHRLLTVQGENALRKHGCLGALPYWDWTRPLSHLPDLVLVSSRTT 
PMPYSTVEARNPWYSGHIDTVGVDTTRSVRQELYEAPGFGHYTGVAKQVLLALEQDDFCDF 
EVQFEIAHNFIHALVGGSEPYGMASLRYTTYDPIFYLHHSNTDRLWAIWQALQKYRGKPYN 
SANCAIASMRKPLQPFGLTDEINPDDETRQHAVPFS VFDYKNNFNYEYDTLDFNGLSI SQL 
DRELSRRKSKDRVFAGFLLHGIQQSALVKFFVCKSDDDCDHYAGEFYILGDEAEMPWGYDR 
LYKYEITEQLNALDLHIGDRFFIRYEAFDLHGTSLGSNIFPKPSVIHDEGA 



Domain d 

GHHQADEYDEWTAASHIRKNLKDLSKGEVESLRSAFLQLQNDGVYENIAKFKGKPGLCDD 
NGRKVACCVHGMPTFPQWHRLYVLQVENALLERGSAVSVPYWDWTETFTELPSLIAEATYF 
NSRQQTFDPNPFFRGKI SFENAVTTRDPQPELYVNRYYYQNVMLVFEQDNYCDFEIQFEl^V 
HNVLHAWLGGRATYSISSLDYSAFDPVFFLHHANTDRLWAIWQELQRYRKKPYNEADCAIN 
LMRKPLHPFDNSDLNKDPVTFKYSKPTDGFDYQNNFGYKYDNLEFNHFS I PRLEEI IRIRQ 
RQDRVFAGFLLKNI GTSATVEI FVCVPTTSGEQNCENKAGTFAVLGGETEMAFKt DRLYRF 
DISETLRDLGIQLDSHDFDLSIKIQGVNGSYLDPHILPEPSLIFVPGSS 



Domain e 

SFLRPDGHSDDILVRKEVNSLTTRETASLIHALKSMQEDHSPDGFQAIASFHALPPLCPSP 
SAAHRYACCVHGMATFPQWHRLYTVQFQDALRRHGATVGVPYWDWLRPQSHLPELVTMETY 
HDIWSNRDFPNPFYQANIEFEGENITTEREVIADKLFVKGGHVFDKLVLQTSHPSAEQENY 
CDFEIQFEILHNGVHTWVGGSRTYSIGKLHYAFYDPLFYLHHFQTDRIWAIWQELQEQRGL 
SGDEAHCALEQMREPLKPFSFGAPYNWNQLTQDFSRPEDTFDYRKFGYEYDNLEFLGMSVA 
ELDQYIIEHQENDRVFAGFLLSGFGGSASWFQVCRADSTCQDAGYFTVLGGSAEMAWAFD 
RLYKYDITETLEKMHLRYDDDFTISVSLTANNGTVLSSSLIPTPSVIFQRGH 



Domain f 

RDINTRSMSPNRVRRELSDLSARDLSSLKSALRDLQEDDGPNGYQALAAFHGLPAGCKDSR 
GNEIACCIHGMPTFPQWHRLYTLQLEMALRRHGSSVAIPYWDWTKPISELPSLFTSPEYYD 
PWHDAVVWNPFSKGFVKFANTYTVRDPQEMLFQLCEHGESILYEQTLLALEQTDYCDFEVQ 
FEVLHWIHYLVGGRQTYALSSLHYASYDPFFFIHHSFVDKMV7VVWQALQKRRKLPYKRAD 
CAVNLMTKPMRPFDS DMNQNPFTKMHAVPNTL YDYETL YY S YDNLEI GGRNLDQLQAEI DR 
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SRSHDRVFAGFLLRGIGTSADVRFWICRNENDCHRGGIIFILGGAKEMPWSFDRNFKFDIT 
HVLENAGISPEDVFDAEEPFYIKVEIHAVNKTMIPSSVIPAPTII YSPGE 



Domain g 



GRAADSAHSANIAGSGVRKDVTTLTVSETENLRQALQGVIDDTGPNGYQAIASFHGSPPMC 
El^GRKVACCAHGMASFPHWHRLYVKQMEDALADHGSHIGIPYWDWTTAFTELPALVTDSE 
NNPFHEGRIDHLGVTTSRSPRDMLFNDPEQGSESFFYRQVLLALEQTDYCQFEVQFELTHN 
AIHSWTGGRSPYGMSTLEFTAYDPLFWLHHSNTDRIWAVWQALQKYRGLPYNEAHCEIQVL 
KQPLRPFNDDINHNPITKTNARPIDSFDYERFNYQYDTLSFHGKSIPELNDLLEERKREER 
TFAAFLLRGIGCSADWFDICRPNGDCVFAGTFAVLGGELEMPWSFDRLFRYDITRVMNQL 
HLQYDSDFSFRVKLVATNGTELSSDLLKSPTIEHEL 



Domain h 



GAHRGPVEETEVTRQHTDGNAHFHRKEVDSLSLDEANNLKNALYKLQNDHSLTGYEAISGY 
HGYPNLCPEEGDDKIPLLRPRMGIFPYWHRLLTIQLERALEHNGALLGVPYWDWNKDLSSL 
PAFFSDSSNNNPYFKYHIAGVGHDTVREPTSLI YNQPQIHGYDYLYYLALTTLEENNYWDF 
EVQYEILHNAVKSWLGGSQKYSMSTLEYSAFDPVFMILHSGLDRLWIIWQELQKIRRKPYN 
FAKCAYHMMEEPLAPFSYPSINQDEFTRANSKPSTVFDS^ 

IINDLRNTDRIYAGFVXSGIGTSASVKI YLRTDDNDEEVGTFTVLGGEREMPWAYERVFKY 
DITEVADRLKIKLWGHPLTSGTGDKILTNGIGGKQEPTQILSSSTDLPIMTTMFLLSQXGR 
NLHIPPKWVKKGTRIEFHPVDDSVTRPW^^ 
GDYYVTGPTRDLCQNADVRIHIHVEDE 
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Figure 8 

KLH1 cDNA sequence and intron structure 



Domain b 

GGCCTACCGTACTGGGACTGGACTGAACCCATGACACACATTCCGGGTCTGGCAGGAAACA 
AAACTTATGTGGATTCTCATG.GTGCATCCCACACAAATCCTTTTCATAGTTCAGTGATTGC 
AT T T G AAG AAAAT G C T C C C C AC AC C AAAAG AC AAAT AG AT C AAAG AC T C T T T AAAC C C G C T 
AC C T T T G G AC AC C A.C AC AG AC C T G T T C AAC C AG AT T T T G T A-T G C C T T T G AACAAG AA G AT T 
ACTGTGACTTT G AAG T C CAAT T T GAG AT T AC C CAT AAC AC GAT T C AC G C T T G G AC AG GAG G 
AAGCGAACATTTCTCAATGTCGTCCCTACATTACACAGCTTTCGATCCTTTGTTTTACTTT 
CACCATTCTAACGTTGATCGTCTTTGGGCCGTTTGGCAAGCCTTACAGATGAGACGGCATA 
AACCCTACAGGGCCCACTGCGCCATATCTCTGGAACATA.TGCATCTGAAACCATTCGCCTT 
T T CAT C T C C C C T T AAC AAT AAC G AAAAG AC T CAT G C CAAT G C CAT G C C AAAC AAG AT C T AC 
G AC TAT G AAAAT G T C C T C CAT T AC AGAT AC G AAG AT T T AAC AT T T G GAG G CAT CTCTCTGG 
AAAAC AT AG AAAAG AT GAT C C AC GAAAAC C AG C AAG AAG AC AG AAT AT AT GCCGGTTTTCT 
CCTGGCTGGCATACGTACTTCAGCAAATGTTGATATCTTCATTAAAACTACCGATTCCGTG 
CAACATAAGGCTGGAACATTTGCAGTGCTCGGTGGAAGCAAGGAAATGAAGTGGGGATTTG 
ATCGCGTTTTCAAGTTTGACATCACGCACGTTTTGAAAGATCTCGATCTCACTGCTGATGG 
CGATTTCGAAGTTACTGTTGACATCACTGAAGTCGATGGAACTAAACTTGCATCCAGTCTT 
ATTCCACATGCTTCTGTCATTCGTGAGCATGCACGTGGTAAGCTGAATAGAG 

Intron b/c 

GTTTTGTAATAATTATGTAGAATTCTTTACCTCAGAATAAGATGAGGTCACATGGGTTTTG 
C AAAAC TAT T AC G T T C G AAT T AAT AT T AAT AAT AC C G G AC C C T C C AC T G G T AC AT AT T TAT 
C T T TAT AAC GAT AAT AG C GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT g AT AAT g 
AT gAT GC CG G TA.TT G CACGTAATCCAGCCGAcTT AG AT G AC AC CCT AAG GGTGC AG AAAGT 
ATAaCAATTAGATTGCGTTtGCATCTGTGTATGCGTGTGCTTTAaCCAAAAGTCAAAATAA 
AAG T GCAAAC C C T TAG T TT AT T CAT T T G ATAG AG C C T T T T AC G AT A.AGAAC AAT G T AATAA 
AT TAG AAC AT AAC T G AAAC C T C C G AAAG AAG GCCTGTTTGT C AAG AG AG G T A*T C G AC A_T G A 
TTGACTTATAAACCTGTGCTTCTATATTTTGGAACTGTCCACTTTCTTGTTGTGTGTACTG 
T AAT C AC AT C G C AC TAT G G C T G C AAG AC G T G T AC GAG T AC AC TAT AT AC T T AC C T AAT G AC 
C AAC C AC AAG GCTGGCTTTGT T AAT AT T G T TAT T T C AC AG AAAT AAAC AC AG AAT T C C AG C 
ATTTGGCTGGTGTATTTAGCAAAACACCGATATGACACTCA.TGTTTTATTACATTTTTTTC 
AG 

Domain c 

TTAAATTTGACAAAGTGCCAAGGAGTCGTCTTATTCGAAAAAATGTAGACCGTTTGAGCCC 
CGAGGAGATGAATGAACTTCGTAAAGCCCTAGCCTTACTGAAAGAGGACAAAAGTGCCGGT 
GGATTTCAGCAGCTTGGTGCATTCCATGGGGAGCCAAAATGGTGTCCTAGTCCCGAAGCAT 
CTAAAAAATTTGCCTGCTGTGTTCACGGCATGTCTGTGTTCCCTCACTGGCATCGACTGTT 
GACGGTTCAGAGTGAAAATGCTTTGAGACGACATGGCTACGATGGAGCTTTGCCGTACTGG 
GATTGGACCTCTCCTCTTAATCACCTTCCCGAACTGGCAGATCATGAGAAGTACGTCGACC 
C T G AAG AT G G G G TAG AG AAG CAT AAC CCTTGGTTC GAT G G T CAT AT AG AT AC AG T C G AC AA 
AAC AAC AAC AAG AAG T G T T C AG AAT AAAC T C T T C G AAC AG C C T GAG T T T G G T CAT TAT AC A 
AGCATTGCCAAACAAGTACTGCTAGCGTTGGAACAGGACAATTTCTGTGACTTTGAAATCC 
AAT AT GAG AT T G C C CAT AAC T AC AT C CAT G C AC T T G TAG GAG G C G C T C AG C C T T A-T G G TAT 
GGCATCGCTTCGCTACACTGCTTTTGATCCACTATTCTACTTGCATCACTCTAATACAGAT 
C G TAT AT G G G C AAT A.T G G C AG G C T T T AC AG AAG T AC AG AG GAAAAC C G T A C AAC G T T G C T A 
ACTGTGCTGTTACATCGA.TGAGAGAACCTTTGCAACCATTTGGCCTCTCTGCCAATATCAA 
C AC AG AC CAT G T AAC C AAG GAG CAT T C AG T G C CAT T C AAC G T T T T T GAT T AC AAGAC CAAT 
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TTCAATTATGAATATGACACTTTGGAATTTAACGGTCTCTCAATCTCTCAGTTGAATAAAA 
AGCTCGAAGCGATAAAGAGCCAAGACAGGTTCTTTGCAGGCTTCCTGTTATCTGGTTTCAA 
GAAATCATCTCTTGTTAAATTCAATATTTGCACCGATAGCAGCAACTGTCACCCCGCTGGA 
GAG T T T T AC CTTCTGGGT GAT G AAAAC GAG AT G C CAT G G G CAT AC GAT AG AG T C T T C AAAT 
AT G AC AT AAC C G AAAAAC T C C AC GAT C T AAAG C T G CAT G C AG AAG AC C AC T T C T AC AT T G A 
CTATGAAGTATTTGACCTTAAACCAGCAAGCCTGGGAAAAGATTTGTTCAAGCAGCCTTCA 
G T C AT T C AT G AAC C AAG AAT AG 

Intron c/d 

G T AC T T G T TAT AT G T T T C G AAT AT T G C C GAT AC C T T C AAT AT AT AT AC T T TAT C AAAG T AA 
T T GAT T AAT C T G AAG T AAT TTTCCTTTC C AG TAG AG AT T C A G T T GAT AC AAC AAG AAT T C G 
CCCTGTTGTATGTCACTTTATTTTCATCAAACGATTCGAAGTGAGCTGTCCATGCCACAAT 
GGGGTCTCTGTAACTTTCTCGTATGGGGTATAGATTATATAGACGTGGCAGACCTTACGTA 
TAACTAATATTTGTGTAATGTCGTTTCAG 



Domain d 

GTCACCATGAAGGCGAAGTATATCAAGCTGAAGTAACTTCTGCCAACCGTATTCGAAAAAA 
CATTGAAAATCTGAGCCTTGGTGAACTCGAAAGTCTGAGAGCTGCCTTCCTGGAAATTGAA 
AACGATGGAACTTACGAATCAATAGCTAAATTCCATGGTAGCCCTGGTTTGTGCCAGTTAA 
ATGGTAACCCCATCTCTTGTTGTGTCCATGGCATGCCAACTTTCCCTCACTGGCACAGACT 
GTACGTGGTTGTCGTTGAGAATGCCCTCCTGAAAAAAGGATCATCTGTAQCTGTTCCCTAT 
T G G G AC T G G AC AAAAC G AAT C G AACAT T T AC C T C AC C T GAT T T C AG AC G C C AC T T AC T AC A 
AT T C C AG G C AAC AT C AC TAT G AG AC AAAC C CAT T C CAT CAT G G C AAAAT C AC AC AC G AGAA 
T G AAAT C AC T AC TAG G GAT C C C AAG G AC A.G C C T C T T C CAT T C AG AC T AC T T T T AC GAG C AG 
GTCCTTTACGCCTTGGAGCAGGATAACTTCTGTGATTTCGAGATTCAGTTGGAGATATTAC 
ACAATGCATTGCATTCTTTACTTGGTGGCAAAGGTAAATATTCCATGTCAAACCTTGATTA 
CGCTGCTTTTGATCCTGTGTTCTTCCTTCATCACGCAACGACTGACAGAATCTGGGCAATC 
TGGCAAGACCTTCAGAGGTTCCGAAAACGGCCATACCGAGAAGCGAATTGCGCTATCCAAT 
T GAT G C AC AC G C C AC T C C AG C C G T T T G AT AAGAG C G AC AAC AAT GAC GAG G C AAC G AAAAC 
GCATGCCACTCCACATGATGGTTTTGAATATCAAAACAGCTTTGGTTATGCTTACGATAAT 
C T G G AAC T G AAT C AC T AC T C GAT T C C T C AG C T T GAT C AC AT G C T G C AAG AAAG AAAAAG G C 
ATGACAGAGTATTCGCTGGCTTCCTCcTTCACAATATTGGAACatCTGCCGATGGCCATGT 
ATTTGTATGTCTCCCAACTGGGGAACACACGAAGGACTGCAGTCATGAGGCTGGTATGTTC 
TCCATCTTAGGCGGTCAAACGGAgATGTCCTTTGTATTTGACAGACTTTACAAACTTGACA 
TAACTAAAGCCTTGAAAAAGAACGGTGTGCACCTGCAAGGGGATTTCGATCTGGAAATTGA 
GAT T AC G G C T G T G AAT G GAT C T CAT C TAG AC AG T CAT G T CAT C C AC T C T C C C AC TAT AC T G 
TTTGAGGCCGGAACAG 



Intron d/e 

G T AAC TAT T T T G T C AC T G T AAC C AAC AAC T G C AG T C TAT T T T G C AAT T AC GAT AAT AAC AA 
T T T T T G AAAT AT AT C T T TAT T AAAG C AAAG G T T T C TAG AG AC AAAC AG C C G G C T C T AAT T A 
TTTTTTCGAACTTACGCTTGAGTAAAGATCTGCAAATGGCAACCCTACCTATACTATTAAA 
AAT ATAAT G T T AC AT T C G TAT C T G AAT G T T TAATAAAT C AC T T CAT AT T C T G T T G C AG 

Domain e 

AT T C T G C C C AC AC AG AT GAT G GAC AC AC T G AAC C AG T GAT GAT T C G C AAAG AT AT C AC AC A 
ATTGGACAAGCGTCAACAACTGTCACTGGTGAAAGCCCTCGAGTCCATGAAAGCCGACCAT 
TCATCTGATGGGTTCCAGGCAATCGCTTCCTTCCATGCTCTTCCTCCTCTTTGTCCATCAC 
CAGCTGCTTCAAAGAGGTTTGCGTGCTGCGTCCATGGCATGCCAACCTTCCCGCAATG 
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Figure 9 



Derived primary structure of KLH1 



Domain b 

GLPYWDWTEPMTHIPGLAGNKTYVDSHGASHTNPFHSSVIAFEENAPHTKRQIDQRLFKPA 
TFGHHTDLFNQILYAFEQEDYCDFEVQFEITHNTIHAWTGGSEHFSMSSLHYTAFDPLFyF 
HHS NVDRLWAVWQALQMRRHKPYRAHCAI SLEHMHLKPFAFS S PLNNNEKTHANAMPNKI Y 
DYENVLHYTYEDLTFGGISLENIEKMIHENQQEDRIYAGFLLAGIRTSANVDIFIKTTDSV 
QHKAGTFAVLGGSKEMKWGFDRVFKFDITKVLKDLDLTADGDFEVTVDITEVDGTKLASSL 
I PHASVI REHARGKLNR 



Domain c 

VKFDKVP RS RL I RKWDRL S PE El^E LRK^ 

SKKFACCVHGMSVFPHWHRLLTVQSENALRRHGYDGALPYWDWTSPLNHLPELADHEKYVD 
PEDGVEKHNPWFDGHIDTVDKTTTRSVQNKLFEQPEFGHYTSIAKQVLLALEQDNFCDFEI 
QYEIAHNYIHALVGGAQPYGMASLRYTAFDPLFYLHHSNTDRIWAIWQALQKYRGKPYNVA 
NCAVTSMREPLQPFGLSANINTDHVTKEHSVPFNVFDYKTNFNYEYDTLEFNGLSISQLNK 
KLEAIKSQDRFFAGFLLSGFKKSSLVKFNICTDSSNCHPAGEFYLLGDENEMPWAYDRVFK 
YDITEKLHDLKLHAEDHFYIDYEVFDLKPASLGKDLFKQPSVIHEPRI 



Domain d 

GHHEGEVYQAEVTSANRIRKNIENLSLGELESLRAAFLEIENDGTYESIAKFHGSPGLCQL 
NGNPISCCVHGMPTFPHWHRLYVWVENALLKKGSSVAVPYWDWTKRIEHLPHLISDATYY 
NSRQHHYETNPFHHGKITHENEITTRDPKDSLFHSDYFYEQVLYALEQDNFCDFEIQLEIL 
HNALHSLLGGKGKYSMSNLDYAAFDPVFFLHHATTDRIWAIWQDLQRFRKRPYREANCAIQ 
LMHTPLQPFDKSDNNDEATKTHATPHDGFEYQNSFGYAYDNLELNHYSIPQLDKMLQERKR 
HDRVFAGFLLHNIGTSADGHVFVCLPTGEHTKDCSHEAGMFSILGGQTEMSFVFDRLYKLD 
I TKALKKNGVHLQGDFDLE I E I TAVNG SHLDSHVI KS PTILFEAG 



Domain e 

DSAHTDDGHTEPVMIRKDITQLDKRQQLSLVKALESMKADHSSDGFQAIAS FHALPPLCPS 
PAASKRFACCVHGMPTFPQWHRLYTVQFQDSLRKHGAWGLPYWDWTLPR 
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Figure 10 



KLH2 cDNA sequence and intron structure 



Domain b 

GGCCTGCCCTACTGGGATTGGACCATGCCAAtGAGTCATTTGCCAGAACTGGCTACAAGTG 
AGACCTACCTCGATCCAGTTACTGGGGAAACTAAAAACAACCCTTTCCATCACGCCCAAGT 
GGCGTTTGAAAATGGTGTAACAAGCAGGAATCCTGATGCCAAACTTTTTATGAAACCAACT 
T AC G GAG AC C AC AC T T AC C T C T T C GAC AG CAT GAT C T AC G C AT T T GAG C AG G AAG AC T T C T 
GCGACTTTGAAGTCCAATATGAGCTCACGCATAATGCAATACATGCATGGGTTGGAGGCAG 
TGAAAAGTATTCAATGTCTTCTCTTCACtacacTGCTTTTGATCCTATATTTTACCTCCAT 
CACTCAAAT'GTTGATCGTCTCTGGGCCATTTGGCAAGCTCTTCAAATCAGGAGAGGCAAGT 
CTTACAAGGCCCACTGCGCCTCGTCTCAAGAAAGAGAACCAT TAAAGCCTTTTGCATTCAG 
T T C C C C AC T G AAC AAC AAC G AG AAAAC G T AC C AC AAC TCTGTCCC C AC T AAC G T T TAT GAC 
TATGTGGGAGTTTTGCACTATCGATATGATGACCTTCAGTTTGGCGGTATGACCATGTCAG 
AAC T T GAG G AAT AT AT T C AC AAG C AG AC AC AAC AT GAT AG AAC C T T T G C AG GAT T C T T C C T 
TTCATATATTGGAACATCAGCAAGCGTAGATATCTTCATCAATCGAGAAGGTCATGATAAA 
TACAAAGTGGGAAGTTTTGTAGTACTTGGTGGATCCAAAGAAATGAAATGGGGCTTTGATA 
G AAT G T AC AAG TAT GAG AT C AC T GAG G C T C T G AAG AC G C T G AAT G T T G C AG T G GAT GAT G G 
GTTCAGCATTACTGTTGAGATCACCGATGTTGATGGATCTCCCCCATCTGCAGATCTCATT 
CCACCTCCTGCTATAATCtTTGaACGTGGTCaTG 

Intron 2b/c 

AGGTATTTAAAAAAGTAATAAAACCaTA.TTTTCGAATGCGCTTTATGAAATATCGTGTGAC 
TGGTTCTT TAG T T T AC AT G GAG T G T AAC AAC AT G C T C CAT C AG T T GAC AT AT AC T G C T C AC 
AC AAAG T AAG G GAT AT T T GAT AAT G AT AAC AAAT AT AAT C AAAG C G G T TAT AC TAT C AAG A 
CTTATTCACATAATTACAGGTGAAGGGAGGTGTGATCGTGTTCACTGATCAGGTTGAGGCC 
AGAGAAGTCCCAGTTTGAGTGTTGCAGAAGATGATGTTTAGGCA.TGGGGTCGAATCACCAA 
AAT C AC AT GAC T T C AAT AAC G G G T T G GAC C AC C T C GAG C GAC g AT G C AAG C A G TAG AG C G T 
CTACGCATGCTCCTGATAAGGCGACCAATCTGTTCCTGGGGAATCAGtCGCCACTCCTCTT 
GTAGTGCCACGCTCATTTCTGCTACGGTCCTGGGTACCTGCTATCGGgTCTTGATCCGTAT 
CCCAAGGATGTCCCACACATGTTCAAgGTGAGAGGTCGGGGAACATCGCTGGCCACGGTaA 
GGtCTGAATTTGATGCCGTTGAAAGTGAGCTCTGACAACcTGAGCATGGtGAGCTCTGACG 
TTGTCGTCCTGAAAGATGAATcCAGCTcCaTGaCAGCGAGCAAaGGGCAGGACGTGTTGGT 
CAATGCAGTTGTCTCTGCAGTAGACACCTGTCACTCGCCACTCACAAGCGTGTAGATCTGT 
AC GAC C AG T CAT G GAG AT C C C AG C C C AC AT CAT AAC G GAC C C C TAT C CA.T AC C GAT CAT GA 
GCCACCATAGCAGCGTCTTGATGACGTTCTCCCTGTCGCCTCGACATCCTcACACGGCCAA 
AAGGAACGTGGACTCGTCACTGAACATGACATTAGCCAACCTGGCACTTGTCCACCGCTGA 
TGTTGGCGAGACCATTCCAGTCGAGCTCTTCGGTGTCTGGCTTTCATCGATAACACGACGT 
AAGGTCTGCGGGCGTGCAAGACGGCTCTATGCAGGCGATTTCGGATTGTCTGGGTGCTAAC 
TCTGATCCCAGGTGCCTGCTGAAGTTGATGCTGGATCTGTGTGGCATTGAGATGGCGATTC 
CTTAGGACTGTGGAGATGA.TGAATCGATCTTGACTTATGGTGGTGACATTAGGACGTCGGG 
TTCGTGTCCTATCCTGCACTCTTCCAGTTGTTCGGTGACGCTCTGGTACCCGGCTGATTAC 
TGACTGAGAATATCCATCTGCCGTGCGACATGAGCCTGTGTTGGCCCAGCCTGAAGCATTG 
C AAT C G C C AG AG AX G C T C T T C AAAAG T CAT T C GAC G CAT GGtTTTCTGTT C AC AAAT GAC A 
GCGTAAAACAGtTTTTGGtGCTTTTATGCTTCCCAAGAGCATGAAAAACACGTTCTATgGG 
TCGtGCACACCTTACAT GACAAGtGtGAAAAGtGACTTGcACCCCCTTGtGtGTTCGGATG 
CACACTCTGTTTACGTACTGATGCGATTTGGCGTCTAAACATGTTTTGGCGTCTAAACATG 
TTTTCCTGCATGATTCATATACTATTTTGTCATATTCCTGGCATCAAACCAAACTACAGTG 
AAAT AT AT T T C AA TAT C C C C T AC T T T G T G T GAG TAG TAT AG AT C AC T G C AG AC AAC AT AT A 



1 
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G AC AAt G C Ag tTaCaCCGT C AAC AAT C C C AG T CAT T AAT TAT GAT GaCaCTTC C AC AC AT A 
G T G T C AG T GAT T G T AAT T C Aa C T G T AC AC AC TTTTCCCGT G AAC A.T T C AG GAT C T AJT AT G A 
C T AAAT AT AT AAC AT TAG TAT AC G T G C AG T T T T G TAT C G C T AC G AC AT T G T T G T AAC T C T T 
TGTTTAATCATTTaACAG 



Domain c 

CTGATGCCAAAGaCTTTGgCCATAGCAGAAAAATCAGgAAAGcCGTTGATTcTcTGACAGT 
C g AAG AAC AAAc T T C G T T GAG g C GAG c TAT G g C AG AT c T AC A.G G AC G AC AAAAC AT C AG G G 
GGTTTCCAGCAGATTGCAGCATTCCACGGAGAACCAAAATGGTGTCCAAGCCCCgAAGCGG 
AGAAAAAATTTGCATGCTGTGTTCATGGAATGGCTGTTTTCCCTCACTGGCACAGATTGCT 
GACAGTTCAAGGAGAAAATGCTCTGAGGAAACATGGCTTTACTGGTGGACTGCCCTACTGG 
GACTGGACTCGATCAATGAGCGCCCTTCCACATTTTGTTGCTGATCCTACTTACAATGATG 
C TAT T T C C AG C C AGG AAGAAGA-T AAC C CAT G G CAT CAT G G T C AC AT AGAC TCTGTTGGG C A 
TGATACTACAAGAGATGTGCGTGATGATCTTTATCAATCTCCTGGTTTCGGTCACTACACA 
GATATTGCACAACAAGTCCTTCTGGCCTTTGAGCAGGACAGTTTCTGTGATTTTGAGGTAC 
AAT T T G AAAT T G C C CAT AAT T T CAT AC AT G C AC T G A.T T G G T G G T AAC G AAC CAT AC AG TAT 
G T CAT C T T T GAG G T A.T AC T AC AT AC GAT C C AAT CTTCTTCTTG C AC C AC T C C AG T AC AG AC 
CGACTTTGGGCCATCTGGCAAGCAATCAXTAGTGCGGCCGCCTGCAGGTCGACCATAAGGG 
AGAGCTCCCAACGCgtTGGAtGCAATCT 



Domain g 

ATGGCTGTGTTTCCGCACTGGCACAGACTGTTTGTGAAACAGATGGAGGACGCACTTGCTG 
CTCATGGAGCTCATATTGGCA.TACCATACTGGGA.TTGGACAAGTGCGTTTAGTCATCTGCC 
CGCCCTAGTGACTGACCACGAGAACAATCCCTTCCACCAC 

Intron g (2) 

GTATGTGTCAAATCGTTTTAGGAACTGCCTTATCCATTTTACAATTACGAGTACAAAATGA 
AAAC G G AAAC T G T G T G AC C T C G AAAAG T G C AAT C T T T AAAG GAT G C AAT G T AC AC AAT AAA 
ATGCTCCGA.TCAAAAGCGATGGCTAGAAATCATTTTCCCTCTAATTCCCTTTCACACAGCT 
CGGTTCGTTT T AAG TAG G AAC AAG T C T C T G C AAAAAC AT C AC AAAT A*AAG AG AAC AC AG AA 
AAAACCTCATTCTCGTTTCTGTATTCCGAAAATGAAATTTACAATTTCTTTCATTTATAG 



Domain g 

GGCCATATTGGTCATCTGAATGTGGATACATCTCGATCTCCAAGAGACATGCTGTTTAATG 
AT C C T G AAC AAG G C T C AG AAT CAT T C T T C T AC AG AC AG GTTCTCTT G AC T C TAG AAC AG AC 
AGACTTCTGCCAATTTGAAGTTCAGTTTGAACTTACACACAATGCCATCCACTCTTGGACT . 
G GAG G AC AT AC T C CAT AT G G AAT G T CAT C AC T G G AAT AJT AC AG CAT AT GAT C C AC T C T T T T ' 
AT C T C C AC CAT T C C AAC AC T GAT C G TAT C T G G G C C A.T C T G G C AG G C AC T C C AG AAAT A.T A-G 
AGGTCTTCCATACAACGCAGCTCACTGCGATATccaagttctgaaacaacct cTTAAACCA 
TTCAGCGAGTCCAGGAATCCAAACCCAGTCACCAGAGCCAATTCTAGGGCCGTTGATTCAT 
T T GAT TAT GAG AAAT T C AAT TAT C AAT AT G AC AC AC T T AC CTTCCACGGACTTTC TAT C C C 
AG AAC T T GA.T G C CAT G C T T C AAG AG AG AAAG AAG G AAGAG AG AAC AT T T G C AG CCTTCCTG 
TTGCACGGATTTGGCGCCAGTGCTGATGTTTCGTTTGATGTCTGCACACCTGATGGTCATT 
GTGCCTTTGCTGGAACCTTCGCGGTACTTGGTGGGGAGCTTGAGATGCCCTGGTCCTTTGA 
AAG AT TGTTCCGT T AC GAT AT C AC AAAG G T T C T C AAG C AG AT G AAT C T T C AC TAT GAT T C T 
GAG T T C C AC T T T G A.G T T G AAG AT T G T T G G C AC AG AT G G AAC AG AAC T G C CAT C G GAT C G T A 
T C AAG AG C C C T AC C A.T T G AAC AC CAT G GAG GAG 
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Intron g/h 

GTATGTTTTGAGATCCACATAATCTTCTACCCTGTCTCATTTCTAATGCTCTTCAATACAC 
AAT T T AT AT AGC C T T T GAG C T T C AG AT G TAT T AC G GAC AG G CAT T AC AG TAT AC AT G T AAT 
ATGGTTTTCTGCTATTTGCAAAAATTGTGTCCTATCTCTGTTCAGATCATCATGGCGGTGA 

CACCTAG 



Domain h 

GTCACGATCACAGTGAACGTCACGATGGATTTTTCAGGAAGGAAGTCGGTTCCCTGTCCCT 
G GAT G AAG C C AAT GAC C T T AAAAAT G C AC T G T A C AAG C T G C AG AAT GAT C AG G G T C C C AAT 
GGATATGAATCAATAGCCGGTTACCATGGCTATCCATTCCTCTGCCCTGAACATGGTGAAG 
ACCAGTACGCATGCTGTGTCCACGGAATGCCTGTATTTCCACATTGGCACAGACTTCATAC 
AATCCAGTTTGAGAGAGCTCTCAAAGAACATGGTTCTCATTTGGGTCTGCCATACTGGGAC 
TGGAC 



V 
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Figure 11 



Derived primary structure of KLH2 



Domain b 

GLPYWDWTMPMSHLPELATSETYLDPVTGETKNNPFHKAQVAFENGVTSRNPDAKLFl^KPT 
YGDHTYLFDSMIYAFEQEDFCDFEVQYELTHNAIHAWVGGSEKYSMSSLHYTAFDPIFYLH 
KSWDRLWAIWQAJLQIRRGKSYKAHCASSQEREPLKPFAFSSPLNNNEKTYHNSVPTWYD 
YVGVLHYRYDDLQFGGMTMSELEEYIHKQTQHDRTFAGFFLS YIGTSASVDIFINREGKDK 
YKVGSFWLGGSKEMKWGFDRMYKYEITEALKTLNVAVDDGFSITVEITDVDGSPPSADLI 
PPPAIIFERGHA 



Domain c 

DAKDFGHSRKIRI^VDSLTVEEQTSLRRAMADLQDDKTSGGFQQIAAFHGEPKWCPSPEAE 
KKFACCVHGMAVFPHWHRLLTVQGENALRKHGFTGGLPYWDWTRSMSALPHFVADPTYNDA 
ISSQEEDNPWHHGHIDSVGHDTTRDVRDDLYQSPGFGHYTDIAQQVLLAFEQDSFCDFEVQ 
FEIAHNFIHALIGGNEPYSMSSLRYTTYDPIFFLHHSSTDRLWAIWQALQKYRGKPYNTAN 
CAIASMRKPLQPFGLDSVINPDDETREKSVPFRVFDYKNNFDYEYESLAFNGLSIAQLDRE 
LQRRKSHDRVFAGFLLHEIGQSAKHNVSDCDKYAGEFYILGDEAEMPWRYDRVYKYEITQQ 
LHDLDLHVGDNFFLKYEAFDLNGGSLGGSIFSQPSVIFEPAAGMF 



Domain d 

GSHQADEYREAVTSASHIRKNIRDLSEGEIESIRSAFLQIQKEGIYENIAKFHGKPGLCEH 
DGHPVACCVHGMPTFPHWHRLYVXQVENALLERGSAVAVPYWDWTLPR 



Domain g 

MAVFPHWHRLFVKQMEDALAAHGAHIGIPYWDWTS 

VDTSRSPRDMLFNDPEQGSESFFYRQVLLTLEQTDFCQFEVQFELTHNAIHSWTGGHTPYG. 

MSSLEYTAYDPLFYLHHSNTDRIWAIWQALQKYRGLPYNAAHCDIQVLKQPLKPFSESRNP 

NPVTRANSRAVDSFDYEKFNYQYDTLTFHGLSIPELDAMLQERKKEERTFAAFLLHGFGAS 

ADVSFDVCTPDGHCAFAGTFAVLGGELEMPWSFERLFRYDITKVLKQMNLHYDSEFHFELK 

IVGTDGTELPSDRIKSPTIEHHGG 



Domain h 

GHDHSERHDGFFRKEVGSLSLDEANDLKNALYKLQNDQGPNGYESIAGYHGYPFLGPEKGE 
DQYACCVHGMPVFPHWHRLHTIQFERALKEHGSHLGLPYWDW 



